(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
30 May 2002 (30.05.2002) 




PCT 



nun ii mi iiiriiiiiMiiiiiiii 

(10) International Publication Number 

WO 02/42461 A2 



(51) International Patent Classification 7 : 

C07K 14/705 



C12N 15/12, 



(21) International Application Number: PCT/US0 1/44386 

(22) International Filing Date: 

26 November 2001 (26.11.2001) 



(25) Filing. language: 

(26) Publication Language: 



(30) 



Priority Data: 

60/253,404 
60/255,366 
60/270,266 
60/270,286 
60/282,032 
60/282,356 
60/282,358 
60/282,365 
60/290,917 
60/309,208 



27 November 
12 December 
20 February 
20 February 
6 April 
6 April 
6 April 
6 April 
14 May 
31 July 



2000(27.11. 
2000(12.12. 
2001 (20.02. 
2001 (20.02, 
2001 (06.04. 
2001 (06.04. 
2001 (06.04. 
2001 (06.04. 
2001 (14.05. 
2001 (31.07. 



English 
English 



2000) US 
.2000) US 
.2001) US 
.2001) US 
.2001) US 
,2001) US 

2001) US 
2001) US 
2001) US 
2001) US 



(71) Applicant (for all designated States except US): ARENA 
PHARMACEUTICALS, INC. [US/US]; 6166 Nancy 
Drive, San Diego, CA 92121 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): CHEN, Ruoping 

[CN/USJ; 5296 Timber Branch Way, San Diego, CA 
92150 (US). CHU, Zhi, Liang [CN/US]; 11489 Creek- 
stone Lane, San Diego, CA 92128 (US). DANG, Huong, 
T. [VN/US]; 5352 Oak Park Drive, San Diego, CA 92105 
(US). LOWITZ, Kevin, P. [US/US]; 8031 Caminito de 
Pizza, Apt. C, San Diego, CA 92108 (US). PRIDE, 
Cameron [US/US]; 11026 West Ocean Air Drive #1128, 
San Diego, CA 92130 (US). 

(74) Agents: STRAHER, Michael, P. et al.; Woodcock Wash- 
burn LLP, 46th floor, One Liberty Place, Philadelphia, PA 
19103 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FT, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KB, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 

[Continued on next page] 



(54) Title: ENDOGENOUS AND NON-ENDOGENOUS VERSIONS OF HUMAN G PROTEIN-COUPLED RECEPTORS 



IP3 Assay in 293 Cells 



< 



is 



700 



^ 600- 



£ 

c 
o 



500- 



400. 



300- 



£ 200. 
100. 



0. 



pCMV 



RUP35 



RUP36 



(57) Abstract: The invention disclosed in this patent document relates to transmembrane receptors, more particularly to a human 
G protein-coupled receptor for which the endogenous ligand is unknown, and to mutated (non-endogenous) versions of the human 
GPCRs for evidence of constitutive activity. 



WU UZ/42401 A2 iniiiiiiiiiiiiiiiiiiiiiiiiiiniiunniiiiiiiNiiiuiiiiiiiii 



MX, MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, 
SL, TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, 
ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE, TR), OAPI patent 
(BF, BJ, CF, CG, CT, CM, GA, GN, GQ, GW, ML, MR, 
NE, SN, TD, TG). 



Declaration under Rule 4.17: 

— of inventorship (Rule 4. 1 7(iv)) for US only 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



5 



ENDOGENOUS AND NON-ENDOGENOUS VERSIONS OF 
HUMAN G PROTEIN-COUPLED RECEPTORS 



CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. Serial Number 09/170,496, filed 
with the United States Patent and Trademark Office on October 13, 1998 and its 

10 corresponding PCT application number PCT/US99/23938, published as WO 00/22129 
on April 20, 2000. This document claims the benefit of priority from the following 
provisional applications, all filed via U.S.- Express Mail with the United States Patent and 
Trademark Office on the indicated dates: U.S. Provisional Number 60/253,404, filed 
November 27, 2000; U.S. Provisional Number 60/255,366, filed December 12, 2000; 

15 U.S. Provisional Number 60/270,286 filed February 20, 2001; U.S. Provisional Number 
60/282,356, filed April 6, 2001, which claims priority from U.S. Provisional Number 
60/270,266, filed February 20, 2001; U.S. Provisional Number 60/282,032, filed April 6, 
2001; U.S. Provisional Number 60/282,358, filed April 6, 2001; U. S. Provisional 
Number 60/282,365, filed April 6, 2001; U.S. Provisional Number 60/290,917, filed 

20 May 14, 2001; U.S. Provisional Number 60/309,208, filed July 31, 2001; the disclosures 
of which are incoiporated in their entirety by reference. 

FIELD OF THE INVENTION 

25 The present invention relates to transmembrane receptors, in some embodiments to 

G proteinrcoupled receptors and, in some preferred embodiments, to endogenous GPCRs 
that are altered to establish or enhance constitutive activity of the receptor. In some 
embodiments, the constitutively activated GPCRs will be used for the direct identification 
of candidate compounds as receptor agonists or inverse agonists having applicability as 

30 therapeutic agents. 
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BACKGROUND OF THE INVENTION 



PATENT 
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Although a number of receptor classes exist in humans, by far the most abundant 
and therapeutically relevant is represented by the G protein-coupled receptor (GPCR) class. 
It is estimated that there are some 30,000-40,000 genes within the human genome, and of 
these, approximately 2% are estimated to code for GPCRs. Receptors, including GPCRs, 
for which the endogenous ligand has been identified, are referred to as "known" receptors, 
while receptors for which the endogenous ligand has not been identified are referred to as 
"orphan" receptors. 

GPCRs represent an important area for the development of pharmaceutical 
products: from approximately 20 of the 100 known GPCRs, approximately 60% of all 
prescription pharmaceuticals have been developed. For example, in 1999, of the top 100 
brand name prescription drugs, the following drugs interact with GPCRs (diseases and/or 
disorders treated are indicated in parentheses): 



Claririn® (allergies) 
Paxil® (depression) 
Cozaar® (hypertension) 
Propulsid® (reflux disease) 
Pepcid® (reflux) 
20 EfFexor® (depression) 
Allegra® (allergies) 
Diprivan® (anesthesia) 
Hytrin® (hypertension) 
Plavix® (Ml/stroke) 
Xalatan® (glaucoma) 
Harnal® (prostatic hyperplasia) 



25 



Prozac® (depression) 
Zoloft® (depression) 
Imitrex® (migraine) 
Risperdal® (schizophrenia) 
Gaster® (ulcers) 
Depakote® (epilepsy) 
Lupron® (prostate cancer) 
BuSpar® (anxiety) 
Wellbutrin® (depression) 
Toprol-XL® (hypertension) 
Singulair® (asthma) 



Vasotec® (hypertension) 
Zyprexa ® (psychotic disorder) 
Zantac® (reflux) 
Serevent® (asthma) 
Atrovent® (broncho spasm) 
Cardura® (prostatic hypertrophy) 
Zoladex® (prostate cancer) 
Ventolin® (bronchospasm) 
Zyrtec® (rhinitis) 
Tenormin® (angina) 
Diovan® (hypertension) 



(Med Ad News 1999 Data). 

GPCRs share a common structural motif, having seven sequences of between 22 to 
30 24 hydrophobic amino acids that form seven alpha helices, each of which spans the 
membrane (each span is identified by number, ie., transmembrane- 1 (TM-1), 
transmebrane-2 (TM-2), etc.). The transmembrane helices are joined by strands of amino 
acids between transmembrane-2 and transmembrane-3 , transmembrane-4 and 
transmembrane-5, and transmembrane-6 and transmembrane-7 on the exterior, or 
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"extracellular" side, of the cell membrane (these are referred to as "extracellular" 

regions 1, 2 and 3 (EC-1, EC-2 and EC-3), respectively). The transmembrane helices are 
also joined by strands of amino acids between transmembrane- 1 and transmembrane^, 
transmembrane-3 and transmembrane-4, and transmembrane- 5 and transmembrane-6 on the 
5 interior, or "intracellular" side, of the cell membrane (these are referred to as "intracellular" 
regions 1, 2 and 3 (IC-1, IC-2 and IC-3), respectively). The "carboxy" ("C") terminus of 
the receptor lies in the intracellular space within the cell, and the "amino" ("N") terminus of 
the receptor lies in the extracellular space outside of the cell. 

Generally, when an endogenous ligand binds with the receptor (often referred to as 

10 "activation" of the receptor), there is a change in the conformation of the intracellular region 
that allows for coupling between the intracellular region and an intracellular "G-protein." It 
has been reported that GPCRs are "promiscuous" with respect to G proteins, i.e., that a 
GPCR can interact with more than one G protein. See, Kenakin, T., 43 Life Sciences 1095 
(1988). Although other G proteins exist, currently, G q , G s , Gj, G z and G 0 are G proteins that 

15 have been identified. Ligand-activated GPCR coupling with the G-protein initiates a 
signaling cascade process (referred to as "signal transduction"). Under normal conditions, 
signal transduction ultimately results in cellular activation or cellular inhibition. Although 
not wishing to be bound to theory, it is thought that the IC-3 loop as well as the carboxy 
terminus of the receptor interact with the G protein. 

20 Under physiological conditions, GPCRs exist in the cell membrane in equilibrium 

between two different conformations: an "inactive" state and an "active" state. A receptor 
in an inactive state is unable to link to the intracellular signaling transduction pathway to 
initiate signal transduction leading to a biological response. Changing the receptor 
conformation to the active state allows linkage to the transduction pathway (via the G- 

25 protein) and produces a biological response. 

A receptor may be stabilized in an active state by a ligand or a compound such as a 
drug. Recent discoveries, including but not exclusively limited to modifications to the 
amino acid sequence of the receptor, provide means other than ligands or drugs to promote 
and stabilize the receptor in the active state conformation. These means effectively stabilize 

30 the receptor in an active state by simulating the effect of a ligand binding to the receptor. 
Stabilization by such ligand-independent means is termed "constitutive receptor activation." 
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SUMMARY OF THE INVENTION 

Disclosed herein are endogenous and non-endogenous versions of human GPCRs 

and uses thereof. 

Some embodiments of the present invention relate to a G protein-coupled 
5 receptor encoded by an amino acid sequence of SEQ.ED.NO.:2, non-endogenous, 
constitutively activated versions of the same, and host cells comprising the same- 
Some embodiments of the present invention relate to a plasmid comprising a 
vector and the cDNA of SEQ.ID.NO.rl and host cells comprising the same. 

Some embodiments of the present invention relate to a G protein-coupled 
10 receptor encoded by an amino acid sequence of SEQ.ID.NO.:4, non-endogenous, 
constitutively activated versions of the same, and host cells comprising the same.. 

Some embodiments of the present invention relate to a plasmid comprising a 
vector and the cDNA of SEQ.ID.NO.:3, non-endogenous, constitutively activated 
versions of the same, and host cells comprising the same. 
15 Some embodiments of the present invention relate to G protein-coupled receptor 

encoded by an amino acid sequence of SEQ.ID.NO.:6, non-endogenous, constitutively 
activated versions of the same, and host cells comprising the same. 

Some embodiments of the present invention relate to a plasmid comprising a 
vector and the cDNA of SEQ.ID.NO.:5 and host cells comprising the same. 
20 Some embodiments of the present invention relate to a G protein-coupled 

receptor encoded by an amino acid sequence of SEQ.ID.NO.:8, non-endogenous, 
constitutively activated versions of the same, and host cells comprising the same. 

Some embodiments of the present invention relate to a plasmid comprising a 
vector and the cDNA of SEQ.ID.NO.:7, non-endogenous, constitutively activated 
25 versions of the same, and host cells comprising the same. 

Some embodiments of the present invention relate to a G protein-coupled 
receptor encoded by an amino acid sequence of SEQ.ID.NO.:10, non-endogenous, 
constitutively activated versions of the same, and host cells comprising the same. 

Some embodiments of the present invention relate to a plasmid comprising a 
30 vector and the cDNA of SEQ.ID.NO.:9 and host cells comprising the same. 
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Some embodiments of the present invention relate to a G protein-coupled 

receptor encoded by an amino acid sequence of SEQ.ID.NO.:12, non-endogenous, 
constitutively activated versions of the same, and host cells comprising the same. 

Some embodiments of the present invention relate to a plasmid comprising a 
5 vector and the cDNA of SEQ.ID.NO. : 1 1 , and host cells comprising the same. 

Some embodiments of the present invention relate to a G protein-coupled 
receptor encoded by an amino acid sequence of SEQ.H).NO.:14, constitutively activated 
versions of the same, and host cells comprising the same. 

Some embodiments of the present invention relate to a plasmid comprising a 
10 vector and the cDNA of SEQ.ID.NO.: 13 and host cells comprising the same. 

Some embodiments of the present invention relate to a G protein-coupled 
receptor encoded by an amino acid sequence of SEQJD.NO.:16, constitutively activated 
versions of the same, and host cells comprising the same. 

Some embodiments of the present invention relate to a plasmid comprising a 
15 vector and the cDNA of SEQ.DD.NO.: 15 and host cells comprising the same. 

Some embodiments of the present invention relate to a G protein-coupled 
receptor encoded by an amino acid sequence of SEQ.ID.NO.:18, constitutively activated 
versions of the same, and host cells comprising the same. 

Some embodiments of the present invention relate to a plasmid comprising a 
20 vector and the cDNA of SEQ.ID.NO.: 17 and host cells comprising the same. 

Some embodiments of the present invention relate to a G protein-coupled 
receptor encoded by an amino acid sequence of SEQ.ID.NO. :20, constitutively activated 
versions of the same, and host cells comprising the same. 

Some embodiments of the present invention relate to a plasmid comprising a 
25 vector and the cDNA of SEQ.ID.NO.: 19 and host cells comprising the same. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a graphic representation of activation of RUP32, G q (del)/Gi, RUP32 
co-transfected with G q (del)/Gi, and CMV (control; expression vector) in a second 
30 messenger assay measuring the accumulation of inositol phosphate (IP3) utilizing 293 cells. 

Figure 2 provides an illustration of second messenger IP3 production from 
endogenous version RUP35 and RUP36 as compared with the control ("CMV"). 
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DETAILED DESCRIPTION 

The scientific literature that has evolved around receptors has adopted a number of 
terms to refer to ligands having various effects on receptors. For clarity and consistency, 
5 the following definitions will be used throughout this patent document. To the extent that 
these definitions conflict with other definitions for these terms, the following definitions 
shall control: 

AGONISTS shall mean materials (e.g., ligands, candidate compounds) that activate 
the intracellular response when they bind to the receptor, or enhance GTP binding to 
1 0 membranes. In some embodiments, AGONISTS are those materials not previously known 
to activate the intracellular response when they bind to the receptor or to enhance GTP 
binding to membranes. 

AMINO ACID ABBREVIATIONS used herein are set out in Table A: 



TABLE A 



ALANINE 


ALA 


A 


ARGIMNE 


ARG 


R 


ASPARAGINE 


ASN 


N 


ASPARTICACID 


ASP 


D 


CYSTEINE 


CYS 


C 


GLUTAMIC ACID 


GLU 


E 


GLUTAMINE 


GLN 


Q 


GLYCINE 


GLY 


G 


fflSTEDINE 


HIS 


H 


ISOLEUCINE 


ILE 


I 


LEUCINE 


LEU 


L 


LYSINE 


LYS 


K 
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METHIONINE 


MET 


M 


PHENYLALANINE 


PHE 


F 


PROLINE 


PRO 


P 


SERINE 


SER 


S 


THREONINE 


THR 


T 


TRYPTOPHAN 


TRP 


W 


TYROSINE 


TYR 


Y 


VALINE 


VAL 


V 



ANTAGONIST shall mean materials (eg., ligands, candidate compounds) that 
competitively bind to the receptor at the same site as the agonists but which do not activate 
the intracellular response initiated by the active form of the receptor, and can thereby inhibit 
5 the intracellular responses by agonists. ANTAGONISTS do not diminish the baseline 
intracellular response in the absence of an agonist. In some embodiments, 
ANTAGONISTS are those materials not previously known to activate the intracellular 
response when they bind to the receptor or to enhance GTP binding to membranes. 

CANDIDATE COMPOUND shall mean a molecule (for example, and not 
10 limitation, a chemical compound) that is amenable to a screening technique. Preferably, the 
phrase "candidate compound" does not include compounds which were publicly known to 
be compounds selected from the group consisting of inverse agonist, agonist or antagonist 
to a receptor, as previously determined by an indirect identification process ("indirectly 
identified compound"); more preferably, not including an indirectly identified compound 
1 5 which has previously been determined to have therapeutic efficacy in at least one mammal; 
and, most preferably, not including an indirectly identified compound which has previously 
been deteimined to have therapeutic utility in humans. 

COMPOSITION means a material comprising at least one component; a 
"pharmaceutical composition" is an example of a composition. 
20 COMPOUND EFFICACY shall mean a measurement of the ability of a 

compound to inhibit or stimulate receptor functionality; i.e. the ability to activate/inhibit a 
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signal transduction pathway, as opposed to receptor binding affinity. Exemplary means 

of detecting compound efficacy are disclosed in the Example section of this patent 
document. 

CODON shall mean a grouping of three nucleotides (or equivalents to nucleotides) 
5 which generally comprise a nucleoside (adenosine (A), guanosine (G), cytidine (C), uridine 
(U) and thymidine (T)) coupled to a phosphate group and which, when translated, encodes 
an amino acid. 

CONSTTTUTIVELY ACTIVATED RECEPTOR shall mean a receptor 
subjected to constitutive receptor activation. A constitutively activated receptor can be 
1 0 endogenous or non-endogenous. 

CONSTITUTIVE RECEPTOR ACTIVATION shall mean stabilization of a 
receptor in the active state by means other than binding of the receptor with its ligand or a 
chemical equivalent thereof. 

CONTACT or CONTACTING shall mean bringing at least two moieties together, 
1 5 whether in an in vitro system or an in vivo system. 

DIRECTLY IDENTIFYING or DIRECTLY IDENTIFIED, in relationship to 
the phrase "candidate compound", shall mean the screening of a candidate compound 
against a constitutively activated receptor, preferably a constitutively activated orphan 
receptor, and most preferably against a constitutively activated G protein-coupled cell 
20 surface orphan receptor, and assessing the compound efficacy of such compound. This 
phrase is, under no circumstances, to be interpreted or understood to be encompassed by or 
to encompass the phrase "indirectly identifying" or "indirectly identified." 

ENDOGENOUS shall mean a material that a mammal naturally produces. 
ENDOGENOUS in reference to, for example and not limitation, the term "receptor," shall 
25 mean that which is naturally produced by a mammal (for example, and not limitation, a 
human) or a virus. By contrast, the term NON-ENDOGENOUS in this context shall mean 
that which is not naturally produced by a mammal (for example, and not limitation, a 
human) or a virus. For example, and not limitation, a receptor which is not constitutively 
active in its endogenous form, but when manipulated becomes constitutively active, is most 
30 preferably referred to herein as a "non-endogenous, constitutively activated receptor." Both 
terms can be utilized to describe both "in vivo" and "in vitro" systems. For example, and 
not limitation, in a screening approach, the endogenous or non-endogenous receptor may be 
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in reference to an in vitro screening system. As a further example and not limitation, 

where the genome of a mammal has been manipulated to include a non-endogenous 
constitutively activated receptor, screening of a candidate compound by means of an in vivo 
system is viable, 

5 G PROTEIN COUPLED RECEPTOR FUSION PROTEIN and GPCR 

FUSION PROTEIN, in the context of the invention disclosed herein, each mean a non- 
endogenous protein comprising an endogenous, constitutively activate GPCR or a non- 
endogenous, constitutively activated GPCR fused to at least one G protein, most preferably 
the alpha (a) subunit of such G protein (this being the subunit that binds GTP), with the G 

10 protein preferably being of the same type as the G protein that naturally couples with 
endogenous orphan GPCR For example, and not limitation, in an endogenous state, if the 
G protein "G s ct" is the predominate G protein that couples with the GPCR, a GPCR Fusion 
Protein based upon the specific GPCR would be a non-endogenous protein comprising the 
GPCR fused to G s oc; in some circumstances, as will be set forth below, a non-predominant 

15 G protein can be fused to the GPCR The G protein can be fused directly to the C-terminus 
of the constitutively active GPCR or there may be spacers between the two. 

HOST CELL shall mean a cell capable of having a Plasmid and/or Vector 
incorporated therein. In the case of a prokaryotic Host Cell, a Plasmid is typically 
replicated as a autonomous molecule as the Host Cell replicates (generally, the Plasmid is 

20 thereafter isolated for introduction into a eukaryotic Host Cell); in the case of a eukaryotic 
Host Cell, a Plasmid is integrated into the cellular DNA of the Host Cell such that when the 
eukaryotic Host Cell replicates, the Plasmid replicates. In some embodiments the Host Cell 
is eukaryotic, more preferably, mammalian, and most preferably selected from the group 
consisting of 293, 293T and COS-7 cells. 

25 INDIRECTLY IDENTIFYING or INDIRECTLY IDENTIFIED means the 

traditional approach to the drug discovery process involving identification of an endogenous 
ligand specific for an endogenous receptor, screening of candidate compounds against the 
receptor for determination of those which interfere and/or compete with the ligand-receptor 
interaction, and assessing the efficacy of the compound for affecting at least one second 

30 messenger pathway associated with the activated receptor. 
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INHIBIT or INHIBITING, in relationship to the term "response" shall 

mean that a response is decreased or prevented in the presence of a compound as opposed to 

in the absence of the compound. 

INVERSE AGONISTS shall mean materials {e.g., ligand, candidate compound) 

5 which bind to either the endogenous form of the receptor or to the constitutively activated 

form of the receptor, and which inhibit the baseline intracellular response initiated by the 

active form of the receptor below the normal base level of activity which is observed in the 

absence of agonists, or decrease GTP binding to membranes. Preferably, the baseline 

intracellular response is inhibited in the presence of the inverse agonist by at least 30%, at 

10 least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, 

at least 92%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, and most 

preferably at least 99% as compared with the baseline response in the absence of the inverse 

agonist. 

KNOWN RECEPTOR shall mean an endogenous receptor for which the 

1 5 endogenous ligand specific for that receptor has been identified. 

LIGAND shalf mean a molecule specific for a naturally occurring receptor. 
MUTANT or MUTATION in reference to an endogenous receptor's nucleic acid 
and/or amino acid sequence shall mean a specified change or changes to such endogenous 
sequences such that a mutated form of an endogenous, non-constitutively activated receptor 

20 evidences constitutive activation of the receptor. In teims of equivalents to specific 
sequences, a subsequent mutated form of a human receptor is considered to be equivalent to 
a first mutation of the human receptor if (a) the level of constitutive activation of the 
subsequent mutated form of a human receptor is substantially the same as that evidenced by 
the first mutation of the receptor; and (b) the percent sequence (amino acid and/or nucleic 

25 acid) homology between the subsequent mutated form of the receptor and the first mutation 
of the receptor is at least 80%, at least 85%, at least 90%, at least 92%, at least 94%, at least 
95%, at least 96%, at least 97%, at least 98%, and most preferably at least 99%. In some 
embodiments, owing to the fact that some preferred cassettes disclosed herein for achieving 
constitutive activation include a single amino acid and/or codon change between the 

30 endogenous and the non-endogenous forms of the GPCR, it is preferred that the percent 
sequence homology should be at least 98%. 



10 



AREN-0309 PATENT 

NON-ORPHAN RECEPTOR shall mean an endogenous naturally occurring 

molecule specific for an identified ligand wherein the binding of a ligand to a receptor 

activates an intracellular signaling pathway. 

ORPHAN RECEPTOR shall mean an endogenous receptor for which the ligand 
5 specific for that receptor has not been identified or is not known. 

PHARMACEUTICAL COMPOSITION shall mean a composition comprising at 
least one active ingredient, whereby the composition is amenable to investigation for a 
specified, efficacious outcome in a mammal (for example, and not limitation, a human). 
Those of ordinary skill in the art will understand and appreciate the techniques appropriate 
10 for detennining whether an active ingredient has a desired efficacious outcome based upon 
the needs of the artisan. 

PLASMID shall mean the combination of a Vector and cDNA. Generally, a 
Plasmid is introduced into a Host Cell for the purposes of replication and/or expression of 
the cDNA as a protein. 

1 5 SECOND MESSENGER shall mean an intracellular response produced as a result 

of receptor activation. A second messenger can include, for example, inositol triphosphate 
(IP3), diacycglycerol (DAG), cyclic AMP (cAMP), and cyclic GMP (cGMP). Second 
messenger response can be measured for a determination of receptor activation. In addition, 
second messenger response can be measured for the direct identification of candidate 

20 compounds, including for example, inverse agonists, agonists, and antagonists. 

SIGNAL TO NOISE RATIO shall mean the signal generated in response to 
activation, amplification, or stimulation wherein the signal is above the background noise or 
the basal level in response to non-activation, non-amplification, or non-stimulation. 

SPACER shall mean a translated number of amino acids that are located after the 

25 last codon or last amino acid of a gene, for example a GPCR of interest, but before the start 
codon or beginning regions of the G protein of interest, wherein the translated number 
amino acids are placed in-frame with the beginnings regions of the G protein of interest. 
The number of translated amino acids can be tailored according to the needs of the skilled 
artisan and is generally from about one amino acid, preferably two amino acids, more 

30 preferably three amino acids, more preferably four amino acids, more preferably five amino 
acids, more preferably six amino acids, more preferably seven amino acids, more preferably 
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eight amino acids, more preferably, nine amino acids, more preferably ten amino 

acids, more preferably eleven amino acids, and even more preferably twelve amino acids. 

STIMULATE or STIMULATING, in relationship to the term "response" shall 
mean that a response is increased in the presence of a compound as opposed to in the 
5 absence of the compound. 

SUBSTANTIALLY shall refer to a result which is within 40% of a control result, 
preferably within 35%, more preferably within 30%, more preferably within 25%, more 
preferably within 20%, more preferably within 15%, more preferably within 10%, more 
preferably within 5%, more preferably within 2%, and most preferably within 1% of a 
10 control result. For example, in the context of receptor functionality, a test receptor may 
exhibit substantially similar results to a control receptor if the transduced signal, measured 
using a method taught herein or similar method known to the art-skilled, if within 40% of 
the signal produced by a control signal. 

VECTOR in reference to cDNA shall mean a circular DNA capable of 
1 5 incorporating at least one cDNA and capable of incorporation into a Host Cell. 

The order of the following sections is set forth for presentational efficiency and is 
not intended, nor should be construed, as a limitation on the disclosure or the claims to 
follow. 

20 A* Introduction 

The traditional study of receptors has typically proceeded from the a priori 
assumption (historically based) that the endogenous ligand must first be identified before 
discovery could proceed to find antagonists and other molecules that could affect the 
receptor. Even in cases where an antagonist might have been known first, the search 

25 immediately extended to looking for the endogenous ligand This mode of thinking has 
persisted in receptor research even after the discovery of constitutively activated receptors. 
What has not been heretofore recognized is that it is the active state of the receptor that is 
most useful for discovering agonists and inverse agonists of the receptor. For those diseases 
which result from an overly active receptor or an under-active receptor, what is desired in a 

30 therapeutic drug is a compound which acts to diminish the active state of a receptor or 
enhance the activity of the receptor, respectively, not necessarily a drug which is an 
antagonist to the endogenous ligand. This is because a compound that reduces or enhances 
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the activity of the active receptor state need not bind at the same site as the endogenous 

ligand Thus, as taught by a method of this invention, any search for therapeutic 
compounds should start by screening compounds against the ligand-independent active 
state. 

5 B. Identification of Human GPCRs 

The efforts of the Human Genome project has led to the identification of a plethora 
of information regarding nucleic acid sequences located within the human genome; it has 
been the case in this endeavor that genetic sequence information has been made available 
without an understanding or recognition as to whether or not any particular genomic 

10 sequence does or may contain open-reading frame information that translate human 
proteins. Several methods of identifying nucleic acid sequences within the human genome 
are within the purview of those having ordinary skill in the art. For example, and not 
limitation, a variety of human GPCRs, disclosed herein, were discovered by reviewing the 
GenBank™ database. Table B, below, lists several endogenous GPCRs that we have 

1 5 discovered, along with other GPCRs that are homologous to the disclosed GPCR. 



TABLE B 



Disclosed 


Accession 


Open Reading 


Reference To 


Per Cent 


Human 


Number 


Frame 


Homologous 


Homology 


Orphan GPCRs 


Identified 


(Base Pairs) 


GPCR 


To Designated 










GPCR 


hRUP28 


AC073957 


l,002bp 


hGPR30 


34% 


hRUP29 


AC083865 


918bp 


hGPR18 


27% 


hRUP30 


AC055863 


l,125bp 


hBRBl 


27% 


MHJP31 


AL356214 


l,086bp 


hGALR-1 


31% 


hRUP32 


AL513524 


l,038bp 


hPNR 


43% 


hRUP33 


AL5 13524 


l,020bp 


GPR57 


50% 








GPR58 


51% 


hRUl'34 


AL5 13524 


l,029bp 


hPNR 


45% 


hRUP35 


AC021089 


l,062bp 


hK-type 3 


27% 








opioid 




hRUP36 


AC090099 


969bp 


GPR90 


42% 
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hRUP37 


AC090099 


969bp 


hMRG 


41% 



Receptor homology is useful in terms of gaining an appreciation of a role of the 
receptors within the human body. As the patent document progresses, techniques for 
mutating these receptors to establish non-endogenous, constitutively activated versions of 
5 these receptors will be discussed. 

The techniques disclosed herein are also applicable to other human GPCRs known 
to the art, as will be apparent to those skilled in the art. 
C. Receptor Screening 

Screening candidate compounds against a non-endogenous, constitutively activated 

10 version of the GPCRs disclosed herein allows for the direct identification of candidate 
compounds which act at the cell surface receptor, without requiring use of the receptor's 
endogenous ligand. Using routine, and often commercially available techniques, one can 
determine areas within the body where the endogenous version of human GPCRs disclosed 
herein is expressed and/or over-expressed. The expression location of a receptor in a 

15 specific tissue provides a scientist with the ability to assign a physiological functional role 
of the receptor. It is also possible using these techniques to determine related 
disease/disorder states which are associated with the expression and/or over-expression of 
the receptor, such an approach is disclosed in this patent document. Furthermore, 
expression of a receptor in diseased organs can assist one in determining the magnitude of 

20 the clinical relevance of the receptor. 

Constitutive activation of the GPCRs disclosed herein is based upon the distance 
from the proline residue at which is presumed to be located within TM6 of the GPCR; this 
algorithmic technique is disclosed in co-pending and commonly assigned patent document 
PCT Application Number PCT/US99/23938, published as WO 00/22129 on April 20, 2000, 

25 which, along with the other patent documents listed herein, is incorporated herein by 
reference. The algorithmic technique is not predicated upon traditional sequence 
"alignment" but rather a specified distance from the aforementioned TM6 proline residue 
(or, of course, endogenous constitutive substitution for such proline residue). By mutating 
the amino acid residue located 16 amino acid residues from this residue (presumably 

30 located in the IC3 region of the receptor) to, most preferably, a lysine residue, constitutive 
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activation of the receptor may be obtained. Other amino acid residues may be useful in 

the mutation at this position to achieve this objective and will be discussed in detail, below. 
D. Disease/Disorder Identification and/or Selection 

As will be set forth in greater detail below, inverse agonists and agonists to the non- 
5 endogenous, constitutively activated GPCR can be identified by the methodologies of this 
invention. Such inverse agonists and agonists are ideal candidates as lead compounds in 
drug discovery programs for treating diseases related to this receptor. Because of the ability 
to directly identify inverse agonists to the GPCR, thereby allowing for the development of 
pharmaceutical compositions, a search for diseases and disorders associated with the GPCR 

10 is relevant. The expression location of a receptor in a specific tissue provides a scientist 
with the ability to assign a physiological function to the receptor. For example, scanning 
both diseased and normal tissue samples for the presence of the GPCR now becomes more 
than an academic exercise or one which might be pursued along the path of identifying an 
endogenous ligand to the specific GPCR. Tissue scans can be conducted across a broad 

15 range of healthy and diseased tissues. Such tissue scans provide a potential first step in 
associating a specific receptor with a disease and/or disorder. Furthermore, expression of a 
receptor in diseased organs can assist one in determining the magnitude of clinical 
relevance of the receptor. 

The DNA sequence of the GPCR can be used to make a probe/primer. In some 

20 preferred embodiments the DNA sequence is used to make a probe for (a) dot-blot analysis 
against tissue-mRNA, and/or (b) RT-PCR identification of the expression of the receptor in 
tissue samples. The presence of a receptor in a tissue source, or a diseased tissue, or the 
presence of the receptor at elevated concentrations in diseased tissue compared to a normal 
tissue, can be used to correlate location to function and indicate the receptor's physiological 

25 role/function and create a treatment regimen, including but not limited to, a disease 
associated with that function/role. Receptors can also be localized to regions of organs by 
this technique. Based on the known or assumed roles/functions of the specific tissues to 
which the receptor is localized, the putative physiological function of the receptor can be 
deduced. For example and not limitation, proteins located/expressed in areas of the 

30 thalamus are associated with sensorimotor processing and arousal (see, Goodman & 
Gilman's, The Pharmacological Basis of Therapeutics , 9 th Edition, page 465 (1996)). 
Proteins expressed in the hippocampus or in Schwann cells are associated with learning and 
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memory, and myelination of peripheral nerves, respectively {see, Kandel, E. et aL, 

Essentials of Neural Science and Behavior pages 657, 680 and 28, respectively (1995)). 

E. Screening of Candidate Compounds 
5 1. Generic GPCR screening assay techniques 

When a G protein receptor becomes constitutively active, it binds to a G protein 
(e.g., G q , G s , Gi, G z , Go) and stimulates the binding of GTP to the G protein. The G protein 
then acts as a GTPase and hydrolyzes the GTP to GDP, whereby the receptor, under normal 
conditions, becomes deactivated. However, constitutively activated receptors continue to 

10 exchange GDP to GTP. A non-hydrolyzable analog of GTP, [ 35 S]GTPyS, can be. used to 
monitor enhanced binding to membranes which express constitutively activated receptors. 
It is reported that [ 35 S]GTPyS can be used to monitor G protein coupling to membranes in 
the absence and presence of ligand. An example of this monitoring, among other examples 
well-known and available to those in the art, was reported by Traynor and Nahorski in 

15 1995. The use of this assay system is typically for initial screening of candidate compounds 
because the system is generically applicable to all G protein-coupled receptors regardless of 
the particular G protein that interacts with the intracellular domain of the receptor. 
2. Specific GPCR screening assay techniques 

Once candidate compounds are identified using the "generic" G protein-coupled 
20 receptor assay (z.e., an assay to select compounds that are agonists or inverse agonists), 
further screening to confirm that the compounds have interacted at the receptor site is 
preferred. For example, a compound identified by the "generic" assay may not bind to the 
receptor, but may instead merely "uncouple" the G protein from the intracellular domain. 

(u Gs, G t and (?,-. 

25 G s stimulates the enzyme adenylyl cyclase. Gj (and G z and Go), on the other hand, 

inhibits adenylyl cyclase. Adenylyl cyclase catalyzes the conversion of ATP to cAMP; 
thus, constitutively activated GPCRs that couple the G s protein are associated with 
increased cellular levels of cAMP. On the other hand, constitutively activated GPCRs that 
couple Gi (or G z , Go) protein are associated with decreased cellular levels of cAMP. See, 

30 generally, 'Indirect Mechanisms of Synaptic Transmission," Chpt 8, From Neuron To 
Brain (3 rd Ed.) Nichols, J.G. et al eds. Sinauer Associates, Inc. (1992). Thus, assays that 
detect cAMP can be utilized to determine if a candidate compound is, e.g, an inverse 
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agonist to the receptor (i.e., such a compound would decrease the levels of cAMP). A 

variety of approaches known in the art for measuring cAMP can be utilized; a most 
preferred approach relies upon the use of anti-cAMP antibodies in an ELISA-based format. 
Another type of assay that can be utilized is a whole cell second messenger reporter system 
5 assay. Promoters on genes drive the expression of the proteins that a particular gene 
encodes. Cyclic AMP drives gene expression by promoting the binding of a cAMP- 
responsive DNA binding protein or transcription factor (CREB) that then binds to the 
promoter at specific sites (cAMP response elements) and drives the expression of the gene. 
Reporter systems can be constructed which have a promoter containing multiple cAMP 

10 response elements before the reporter gene, e.g., P-galactosidase or luciferase. Thus, a 
constitutively activated G s -linked receptor causes the accumulation of cAMP that then 
activates the gene and leads to the expression of the reporter protein. The reporter protein 
such as P-galactosidase or luciferase can thai be detected using standard biochemical 
assays (Chen et al. 1995). 

15 b. G 0 andG T 

G q and G 0 are associated with activation of the enzyme phospholipase C, which in 
turn hydrolyzes the phospholipid PP 2 , releasing two intracellular messengers: 
diacycloglycerol (DAG) and inositol 1,4,5-triphoisphate (D? 3 ). Increased accumulation of 

20 IP 3 is associated with activation of G q - and Go-associated receptors. See, generally, 
'Indirect Mechanisms of Synaptic Transmission," Chpt. 8, From Neuron To Brain (3 rd Ed.) 
Nichols, J.G. et al eds. Sinauer Associates, Inc. (1992). Assays that detect IP3 accumulation 
can be utilized to determine if a candidate compound is, e.g., an inverse agonist to a G q - or 
Go-associated receptor (ie., such a compound would decrease the levels of IP 3 ). G q - 

25 associated receptors can also be examined using an API reporter assay wherein G q - 
dependent phospholipase C causes activation of genes containing API elements; thus, 
activated G q -associated receptors will evidence an increase in the expression of such genes, 
whereby inverse agonists thereto will evidence a decrease in such expression, and agonists 
will evidence an increase in such expression. Commercially available assays for such 

30 detection are available. 

3. GPCR Fusion Protein 
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The use of an endogenous, constitutively activated GPCR or a 

non-endogenous, constitutively activated GPCR, for use in screening of candidate 
compounds for the direct identification of inverse agonists, agonists provide an interesting 
screening challenge in that, by definition, the receptor is active even in the absence of an 
5 endogenous ligand bound thereto. Thus, in order to differentiate between, e.g., the non- 
endogenous receptor in the presence of a candidate compound and the non-endogenous 
receptor in the absence of that compound, with an aim of such a differentiation to allow for 
an understanding as to whether such compound may be an inverse agonist or agonist or 
have no affect on such a receptor, it is preferred that an approach be utilized that can 

1 0 enhance such differentiation. A preferred approach is the use of a GPCR Fusion Protein. 

Generally, once it is determined that a non-endogenous GPCR has been 
constitutively activated using the assay techniques set forth above (as well as others), it is 
possible to determine the predominant G protein that couples with the endogenous GPCR. 
Coupling of the G protein to the GPCR provides a signaling pathway that can be assessed. 

15 In some embodiments it is preferred that screening take place using a m amm alian 
expression system, such a system will be expected to have endogenous G protein therein. 
Thus, by definition, in such a system, the non-endogenous, constitutively activated GPCR 
will continuously signal. In some embodiments it is preferred that this signal be enhanced 
such that in the presence of, an inverse agonist to the receptor, it is more likely that it 

20 will be able to more readily differentiate, particularly in the context of screening, between 
the receptor when it is contacted with the inverse agonist. 

The GPCR Fusion Protein is intended to enhance the efficacy of G protein coupling 
with the non-endogenous GPCR. The GPCR Fusion Protein is preferred for screening with 
either an endogenous, constitutively active GPCR or a non-endogenous, constitutively 

25 activated GPCR because such an approach increases the signal that is utilized in such 
screening techniques. This is important in facilitating a significant "signal to noise'* ratio; 
such a significant ratio is preferred for the screening of candidate compounds as disclosed 
herein. 

The construction of a construct useful for expression of a GPCR Fusion Protein is 
30 within the purview of those having ordinary skill in the art. Commercially available 
expression vectors and systems offer a variety of approaches that can fit the particular needs 
of an investigator. Important criteria on the construction of such a GPCR Fusion Protein 
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construct include but are not limited to, that the endogenous GPCR sequence and Hie G 

protein sequence both be in-frame (preferably, the sequence for the endogenous GPCR is 
upstream of the G protein sequence), and that the "stop" codon of the GPCR be deleted or 
replaced such that upon expression of the GPCR, the G protein can also be expressed. 
5 Other embodiments include constructs wherein the endogenous GPCR sequence and the G 
protein sequence are not in-frame and/or the "stop" codon is not deleted or replaced. The 
GPCR can be linked directly to the G protein, or there can be spacer residues between the 
two (preferably, no more than about 12, although this number can be readily ascertained by 
one of ordinary skill in the art). Based upon convenience it is preferred to use a spacer. 

10 Preferably, the G protein that couples to the non-endogenous GPCR will have been 
identified prior to the creation of the GPCR Fusion Protein construct. Because there are 
only a few G proteins that have been identified, it is preferred that a construct comprising 
the sequence of the G protein (i.e., a universal G protein construct (see Examples)) be 
available for insertion of an endogenous GPCR sequence therein; this provides for further 

15 efficiency in the context of large-scale screening of a variety of different endogenous 
GPCRs having different sequences. 

As noted above, constitutively activated GPCRs that couple to Q, G z and G G are 
expected to inhibit the formation of cAMP making assays based upon these types of GPCRs 
challenging (z.e., the cAMP signal decreases upon activation thus making the direct 

20 identification of, e.g., inverse agonists (which would further decrease this signal), 
challenging. As will be disclosed herein, we have ascertained that for these types of 
receptors, it is possible to create a GPCR Fusion Protein that is not based upon the GPCRs 
endogenous G protein, in an effort to establish a viable cyclase-based assay. Thus, for 
example, an endogenous G\ coupled receptor can be fused to a G s protein -such a fusion 

25 construct, upon expression, "drives" or "forces" the endogenous GPCR to couple with, e.g., 
G s rather than the "natural" Q protein, such that a cyclase-based assay can be established. 
Thus, for Q, G 2 and G 0 coupled receptors, in some embodiments it is preferred that when a 
GPCR Fusion Protein is used and the assay is based upon detection of adenylyl cyclase 
activity, that the fusion construct be established with G s (or an equivalent G protein that 

30 stimulates the formation of the enzyme adenylyl cyclase). 



G 


Effect of cAMP 


Effect of EP 3 


Effect of 


Effect on IP 3 
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activation or 
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agonist DinQingj 


/iff AtiiCi4 Ki?if1innr\ 

agonist Dinaingj 


Inverse 










/vgomst 




G s 


Increase 


N/A 


Decrease 


N/A 


Q 


Decrease 


N/A 


Increase 


N/A 


G z 


Decrease 


N/A 


Increase 


N/A 


Go 


Decrease 


Increase 


Increase 


Decrease 


G q 


N/A 


Increase 


N/A 


Decrease 



Equally effective is a G Protein Fusion construct that utilizes a G q Protein fused 
with a G s , Q, G z or G 0 Protein. In some embodiments a preferred fusion construct can be 
accomplished with a G q Protein wherein the first six (6) amino acids of the G-protein a- 
5 subunit ("Gaq") is deleted and the last five (5) amino acids at the C-terminal end of Gccq is 
replaced with the corresponding amino acids of the Got of the G protein of interest. For 
example, a fusion construct can have a G q (6 amino acid deletion) fused with a Gj Protein, 
resulting in a "Gq/Gj Fusion Construct". This fusion construct will forces the endogenous 
Gi coupled receptor to couple to its non-endogenous G protein, G q , such that the second 
10 messenger, for example, inositol triphosphate or diacylgycerol, can be measured in lieu of 
cAMP production. 

4. Co-transfection of a Target G\ Coupled GPCR with a Signal- 
Enhancer G s Coupled GPCR (cAMP Based Assays) 

15 A Q coupled receptor is known to inhibit adenylyl cyclase, and, therefore, decreases 

the level of cAMP production, which can make assessment of cAMP levels challenging. An 
effective technique in measuring the decrease in production of cAMP as an indication of 
constitutive activation of a receptor that predominantly couples Gi upon activation can be 
accomplished by co-transfecting a signal enhancer, e.g., a non-endogenous, constitutively 

20 activated receptor that predominantly couples with G s upon activation (e.g., TSHR-A623I, 
disclosed below), with the Gi linked GPCR. As is apparent, constitutive activation of a G s 
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coupled receptor can be determined based upon an increase in production of cAMP. 

Constitutive activation of a Gi coupled receptor leads to a decrease in production cAMP. 
Thus, the co-transfection approach is intended to advantageously exploit these "opposite" 
affects. For example, co-transfection of a non-endogenous, constitutively activated G s 
5 coupled receptor (the "signal enhanced *) with the endogenous Gj coupled receptor (the 
"target receptor") provides a baseline cAMP signal (ie., although the G\ coupled receptor 
will decrease cAMP levels, this "decrease" will be relative to the substantial increase in 
cAMP levels established by constitutively activated G s coupled signal enhancer). By then 
co-transfecting the signal enhancer with a constitutively activated version of the target 

10 receptor, cAMP would be expected to further decrease (relative to base line) due to the 
increased functional activity of the Gi target (/.e., which decreases cAMP). 

Screening of candidate compounds using a cAMP based assay can then be 
accomplished, with two 'changes 1 relative to the use of the endogenous receptor/G-protein 
fusion: first, relative to the G { coupled target receptor, "opposite" effects will result, z.e., an 

15 inverse agonist of the Gi coupled target receptor will increase the measured cAMP signal, 
while an agonist of the G, coupled target receptor will decrease this signal; second, as would 
be apparent, candidate compounds that are directly identified using this approach should be 
assessed independently to ensure that these do not target the signal enhancing receptor (this 
can be done prior to or after screening against the co-transfected receptors). 

20 

F. Medicinal Chemistry 

Generally, but not always, direct identification of candidate compounds is 
conducted in conjunction with compounds generated via combinatorial chemistry 
techniques, whereby thousands of compounds are randomly prepared for such analysis. 
25 Generally, the results of such screening will be compounds having unique core 
structures; thereafter, these compounds may be subjected to additional chemical 
modification around a preferred core structure(s) to further enhance the medicinal 
properties thereof. Such techniques are known to those in the art and will not be 
addressed in detail in this patent document. 

30 

G. Pharmaceutical compositions 
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Candidate compounds selected for further development can be formulated into 

pharmaceutical compositions using techniques well known to those in the art. Suitable 
pharmaceutically-acceptable carriers are available to those in the art; for example, see 
Remington's Pharmaceutical Sciences, 16 th Edition, 1980, Mack Publishing Co., (Osol et 
5 al., eds.). 

H. Other Utilities 

Although a preferred use of the non-endogenous versions of the GPCRs disclosed 
herein may be for the direct identification of candidate compounds as inverse agonists or 
agonists (preferably for use as pharmaceutical agents), other uses of these versions of 

10 GPCRs exist. For example, in vitro and in vivo systems incorporating GPCRs can be 
utilized to further elucidate and understand the roles these receptors play in the human 
condition, both normal and diseased, as well as understanding the role of constitutive 
activation as it applies to understanding the signaling cascade. In some embodiments it is 
preferred that the endogenous receptors be "orphan receptors", the endogenous ligand 

15 for the receptor has not been identified. In some embodiments, therefore, the modified, 
non-endogenous GPCRs can be used to understand the role of endogenous receptors in the 
human body before the endogenous ligand therefore is identified. Such receptors can also 
be used to further elucidate known receptors and the pathways through which they 
transduce a signal. Other uses of the disclosed receptors will become apparent to those in 

20 the art based upon, inter alia, a review of this patent document. 

EXAMPLES 

The following examples are presented for purposes of elucidation, and not 
limitation, of the present invention. While specific nucleic acid and amino acid sequences 
are disclosed herein, those of ordinary skill in the art are credited with the ability to make 

25 minor modifications to these sequences while achieving the same or substantially similar 
results reported below. The traditional approach to application or understanding of 
sequence cassettes from one sequence to another (e.g. from rat receptor to human receptor 
or from human receptor A to human receptor B) is generally predicated upon sequence 
alignment techniques whereby the sequences are aligned in an effort to determine areas of 

30 commonality. The mutational approach disclosed herein does not rely upon this approach 
but is instead based upon an algorithmic approach and a positional distance from a 
conserved proline residue located within the TM6 region of human GPCRs. Once this 
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approach is secured, those in the art are credited with the ability to make minor 

modifications thereto to achieve substantially the same results (i.e., constitutive activation) 
disclosed herein. Such modified approaches are considered within the purview of this 
disclosure. 
5 Example 1 

Endogenous Human Gpcrs 

1 . Identification of Human GPCRs 
The disclosed endogenous human GPCRs were identified based upon a review of 
the GenBank™ database information. While searching the database, the following 
1 0 cDNA clones were identified as evidenced below (Table Q. 

TABLE C 



Disclosed 
Human 
Orphan 
GPCRs 


Accession 
Number 
Identified 


Open 
Reading 
Frame 
(Base Pairs) 


Reference To 
Homologous 
GPCR 


Nucleic Acid 
SEQJD. NO. 


Amino Acid 
SEQJDJNO. 


hRUP28 


AC073957 


l,002bp 


b.GPR30 


1 


2 


hRUP29 


AC083865 


918bp 


hGPR18 


3 


4 


hRUP30 


AC055863 


l,125bp 


hBRBl 


5 


6 


hRUP31 


AL356214 


l,086bp 


b.GALR-1 


7 


8 


hRlJP32 


AL513524 


l,038bp 


hPNR 


9 


10 [ 


hRUP33 


AL5 13524 


l,020bp 


GPR57 
GPR58 


11 


12 


hRUP34 


AL5 13524 


l,029bp 


hPNR 


13 


14 


hJRUP35 


AC021089 


l,062bp 


hK-type 3 
opioid 


15 


16 


hRUP36 


AC090099 


969bp 


GPR90 


17 


18 


hRUP37 


AC090099 


969bp 


hMRG 


19 


20 



2. Full Length Cloning 

a- hRUP28(Seq.Id.Nos.l&2) 

15 The disclosed human RUP28 was identified based upon the use of GenBank database 

information. While searching the database, a cDNA clone with Accession Number 

AC073957 was identified as a human genomic sequence from chromosome 7. 

The full length RUP28 was cloned by PCR using primers: 
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5'- CAGAGCTCTGGTGGCCACCTCTGTCC- 

3' (SEQ.ID.N0.:21; sense, 5' of initiation codon), 

5 , -CTGCGTCCACCAGAGTCACGTCTCC-3 , (SEQ.ID.NO.:22; antisense, 3' of stop 
codon), and human adult liver Marathon-Ready™ cDNA (Clontech) as template. 
5 Advantage™ cDNA polymerase (Clontech) was used for the amplification in a 50^1 
reaction by the following cycle with step 2 to 4 repeated 35 times: 95°C for 5 min; 94°C 
for 30 sec; 58°C for 30 sec; 72°C for 1 min 30 sec; and 72°C for 7 min. 

A L16kb PCR fragment was isolated from a 1% agarose gel and cloned into the 
pCRH-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit 
10 (P.E. Biosystems). See, SEQ.ID.NO.:l for the nucleic acid sequence and SEQ.ID.NO.:2 
for the putative amino acid sequence. 

b. hRUP29(Seq.Id.Nos.3&4) 
The disclosed human RUP29 was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with Accession 
15 Number AC0083865 was identified as a human genomic sequence from chromosome 7. 
The full length RUP29 was cloned by PCR using primers: 
5 ' -GTATGCCTGGCCAC AATACCTCCAGG-3 9 (SEQ.ID.NO.:23; sense, containing the 
initiation codon), 

5 '-GTTTGTGGCTAACGGCAC AAAACACAATTCC-3 ' (SEQ.ID.NO.:24; antisense, 
20 containing the stop codon) and human genomic DNA as template. TaqPlus® Precision 

DNA polymerase (Stratagene) was used for the amplification in a 50pl reaction by the 

following cycle with step 2 to 4 repeated 35 times: 94°C for 5 min; 94°C for 30 sec; 54°C 

for 30 sec; 72°C for 1 min 30 sec; and 72°C for 7 min. 

A 930bp PCR fragment was isolated from a 1% agarose gel and cloned into the 
25 pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit 

(P.E. Biosystems). 

Rapid amplification of cDNA ends (RACE) was performed using human 
leukocyte and ovary Marathon-Ready™ cDNA (Clontech) to determine the precise 5* 
end of RUP29 cDNA. RUP29 specific primer (1) having the sequence: 
30 5-GGTACCACAATGACAATCACCAGCGTCC-3 , (SEQ.K>.NO.:25) 

and API primer (Clontech) were used for the first-round PCR reaction, and RUP29 
specific primer (2) having the following sequence: 
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5'- 

GGAACGTGAGGTACATGTGGATGTGCAGC-3 ' (SEQ.ID.NO. :26) 
and AP2 primer (Clontech) were used for the second-round PGR reaction. The products 
of the RACE reactions were isolated and cloned into the pCRQ-TOPO vector 
5 (Invitrogen) and sequenced. See, SEQ.ID.NO. :3 for the nucleic acid sequence and 
SEQ.ID.NO. :4 for the putative amino acid sequence, 
c. hRUP30 (Seq. Id. Nos. 5 & 6) 
The disclosed human RUP30 was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with Accession 
10 Number AC055863 was identified as a human genomic sequence from chromosome 17. 

The full length RUP30 was cloned by 5 'RACE -PCR with a human pancreas 
Marathon-Ready™ cDNA (Clontech) as template and the following oligonucleotide: 
5 '-GCAGTGTAGCGGTCAACCGTGAGCAGG-3 '(SEQ.ID.NO. :27; sense, containing the 
initiation codon), and API primer (Clontech) were used for the first round of RT-PCR and 
1 5 oligonucleotide: 

5 '-TGAGCAGGATGGCGATCCAGACTGAGGCGTGG-3 '(SEQ.ID.NO. :28; antisense, 
containing the stop codon) and AP2 primer (Clontech) were used for the second round of 
PCR. DNA fragments generated by the 5' RACE-PCR were cloned into the pCRII-TOPO 
vector (Invitrogen) and sequenced using the SP6/T7 primers (Stratagene). 
20 Based on the sequence of the 5' RACE products, the full length RUP30 was cloned 

by RT-PCR, using primers: 

5 '-GAGGTACAGCTGGCGATGCTGACAG-3 9 (SEQ.ID.NO.:29; sense, ATG as the 
initiation codon); 

5'-GTGGCCATGAGCCACCCTGAGCTCC-3' (SEQ.ID.NO.:30; antisense, 3' of the stop 
25 codon) and human pancreas Marathon-Ready™ cDNA (Clontech) as template. Taq DNA 

polymerase (Stratagene) was used for the amplification in 50 pi reaction by the following 

cycle with step 2 to step 4 repeated 35 times: 94°C for 40 seconds; 94°C for 20 seconds; 

64°C for 20 seconds; 72°C for 2 minutes; and 72°C for 5 minutes. 

A 1.2 Kb PCR fragment was isolated from a 1% agarose gel and cloned into the 
30 pCRH-TOPO vector (Invitrogen) and several clones were sequenced using the ABI Big 

Dye Terminator kit (P.E. Biosystems). See, SEQ.ID.NO. :5 for the nucleic acid sequence 

and SEQ.ID.NO. :6 for the putative amino acid sequence. 

25 



AREN-0309 PATENT 

d. hRUP31 (Seq. Id. Nos. 7 & 8) 

The disclosed human RUP31 was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with Accession 
Number AL356214 was identified as a human genomic sequence from chromosome 10. 
5 The full length RUP3 1 was cloned by RT-PCR using primers: 

5 ' -GGAATGTCC ACTGAATGCGCGCGG-3 * (SEQJQD.NO.:31; sense, containing the 
initiation codon), 

5 '-AGCTCGCC AGGTGTGAGAAACTCGG-3 * (SEQ.ID.NO.:32; antisense, 3' of stop 

codon) and human mammary gland Marathon-Ready™ cDNA (Clontech) as template. 
10 Advantage™ cDNA polymerase (Clontech) was used for the amplification in 50 pi reaction 

by the following cycle with step 2 to step 4 repeated 35 times: 94°C for 40 sec; 94°C for 20 

sec; 66°C for 20 sec; 72°C for 1 min 30 sec; and 72°C for 5 min. 

A 1.1 kb PCR fragment was isolated from a 1% agarose gel and cloned into the 

pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit 
15 (P.E. Biosystems). See, SEQ.ID.NO.:7 for the nucleic acid sequence and SEQ.ID.NO.:8 

for the putative amino acid sequence. 

e. hRUP32(Seq.Id.Nos.9&10) 

The disclosed human RUP32 was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with Accession 
20 Number AL5 13524 was identified as a human genomic sequence from chromosome 6. 

The full length RUP32 was cloned by PCR using primers: 
5 '-GCGTTATGAGCAGCAATTCATCCCTGCTGG-3 ' (SEQ.ID.NO.:33; sense, 
containing the initiation codon), 

5 '-GTATCCTGAACTTCGTCTATACAACTGC-3 ' (SEQ.ID.NO.:34; antisense) 
25 and human genomic DNA (Clontech) as template. TaqPlus® Precision DNA polymerase 
(Stratagene) was used for the amplification by the following cycle with step 2 to step 4 
repeated 35 times: 94°C for 3 min; 94°C for 20 sec; 58°C for 20 sec; 72°C for 1 min 30 sec; 
and 72T for 7 min. 

A 1.06 kb PCR fragment was isolated from a 1% agarose gel and cloned into the 
30 pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit 
(P.E. Biosystems). See, SEQ.ID.NO.:9 for the nucleic acid sequence and 
SEQ.ID.NO.: 10 for the putative amino acid sequence. 
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f. hRUP33 (Seq. Id. Nos. 11 & 12) 

The disclosed human RUP33 was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with Accession 
Number AL513524 was identified as a human genomic sequence from chromosome 6. 
5 The full length RTJP33 was cloned by PCR using primers: 

5^CCCTCAGGAATGATGCCCTTTTGCCACAA-3' (SEQ.ID.NO.:35; sense, containing 
the initiation codon), 

5^ATCCATGTGGTTGGTGCATGTGGTTCGT-3' (SEQ.TD.NO.:36; antisense) 
and human genomic DNA (Clontech) as template. TaqPlus® Precision DNA polymerase 
10 (Stratagene) was used for the amplification by the following cycle with step 2 to step 4 
repeated 35 times: 94°C for 3 min; 94°C for 20 sec; 56°C for 20 sec; 72°C for 1 min 30 sec; 
and72°Cfor7min. 

A 1.1 kb PCR fragment was isolated from a 1% agarose gel and cloned into the 
pCRH-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit 
15 (P.E. Biosystems). See, SEQ.ID.NO.:ll for the nucleic acid sequence and 
SEQ.ID.NO.: 12 for the putative amino acid sequence. 

g. hRUP34(Seq.Id.Nos.l3&14) 

The disclosed human RUP34 was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with Accession 
20 Number AL5 1 3 524 was identified as a human genomic sequence from chromosome 6. 

The full length RUP34 was cloned by PCR using primers: 
5 '-AAACAACAAACAGCAGAACCATGACCAGC-3 ' (SEQ.ID.NO.:37; sense, 
containing the initiation codon), 

5'-ACATAGAGACAAGTGACATGTGTGAACCAC-3' (SEQ.ID.NO.:38; antisense) 
25 and human genomic DNA (Clontech) as template. TaqPlus® Precision DNA polymerase 
(Stratagene) was used for the amplification by the following cycle with step 2 to step 4 
repeated 35 times: 94°C for 3 min; 94°C for 20 sec; 60°C for 20 sec; 72°C for 1 min 30 sec; 
and 72°C for 7 min. 

A 1.27 kb PCR fragment was isolated from a 1% agarose gel and cloned into the 
30 pCRTJ-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit 
(P.E. Biosystems). See, SEQ.ID.N0..13 for the nucleic acid sequence and 
SEQ.ID.NO.: 14 for the putative amino acid sequence. 
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h. hRUP35 (Seq. Id. Nos. 15 & 16) 

The disclosed human RUP35 was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC021089 was identified as a human genomic sequence from chromosome 16. 
5 The 5* sequence of RUP35 was determined by 5 f RACE- PCR with a human fetal 

brain Marathon-Ready™ cDNA (Clontech) as template. Oligonucleotide 
5 9 -GGTATGAGACCGTGTGGTACTTGAGC-3 ' (SEQJD.NO.:39; sense) 
and API primer (Clontech) were used for the first round of RT-PCR and oligonucleotide 
5 '-GTGGCAGACAGCGATATACCTGTCAATGG-3 * (SEQHXNO.:40; antisense) 
10 and AP2 primer (Clontech) were used for the second round of PCR. DNA fragments 
generated by the 5' RACE-PCR were cloned into the pCRH-TOPO vector (Invitrogen) and 
sequenced using the SP6/T7 primers (Stratagene). 

Based upon the sequence of the 5 f RACE products, the fiill length RUP35 was 
cloned by RT-PCR, using primers 
15 5'-GCGCTCATGGAGCACACGCACGCCCAC-3' (SEQ.ID.NO.:41; sense, ATG as the 
initiation codon) and 

5 '-GAGGCAGTAGTTGCCACACCTATGG-3 ' (SEQ.ID.NO.:42; antisense, 3' of the stop 
codon) and human brain Marathon-Ready™ cDNA (Clontech) as template. Advantage™ 
cDNA polymerase (Clontech) was used for the amplification in 100 jd reaction by the 

20 following cycle with step 2 to step 4 repeated 45 times: 95°C for 2 min; 95°C for 20 sec; 
60°C for 20 sec; 72°C for 1 min 30 sec; and 72°C for 5 min. 

A 1.0 kb PCR fragment was isolated from a 1% agarose gel and cloned into the 
pCRH-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit 
(P.E. Biosystems). See, SEQ.ID.NO.:15 for the nucleic acid sequence and 

25 SEQ.ID.NO. : 1 6 for the putative amino acid sequence. 

i. hRUP36(Seq.Id.Nos.l7&18) 

The disclosed human RUP36 was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC090099 was identified as a human genomic sequence from chromosome 11. 
30 The full length RUP36 was cloned by PCR using primers: 

5'- CATCTGGTTTGTGTTCCCAGGGGCACCAG -3' (SEQ.ID.NO.:43; sense, 5' of start 
codon), 

28 



AREN-0309 PATENT 

5'- 

GACAGTGTTGCTCTCAAAGTCCCGTCTGACTG -3' (SEQ.ID.NO.:44; antisense, 3' of 
stop codon) and human genomic DNA (Clontech) as template. TaqPlus® Precision DNA 
polymerase (Stratagene) was used for the amplification in a 50jal reaction by the following 
5 cycle with step 2 to step 4 repeated 30 times: 95°C, 5 min; 95°C for 30 sec; 70°C for 30 sec; 
72°C for 1 min 30 sec; and 72°C for 7 min. 

A 1.0 kb PCR fragment was isolated from a 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit 
(P.E. Biosystems). See, SEQ.ID.NO.:17 for the nucleic acid sequence and 

10 SEQ.ID.NO.:18 for the putative amino acid sequence. 

j. hRUP37(Seq.IcLNos.l9&20) 
The disclosed human RUP37 was identified based upon the use of GenBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC090099 was identified as a human genomic sequence from chromosome 11. 

15 The full length RUP37 was cloned by PCR using primers: . 

5 '-CTGTTTCCAGGGTCATCAGACTGGG-3 9 (SEQ.ID.NO.:45; sense); 
5 ' -GC AGCATTGCTCTCAAAGTCCTGTCTG-3 ' (SEQ.ID.NO.:46; antisense) 
and human genomic DNA (Clontech) as template. TaqPlus® Precision DNA polymerase 
(Stratagene) was used for the amplification by the following cycle with step 2 to step 4 

20 repeated 35 times: 95°C for 5 min; 95°C for 30 sec; 62°C for 30 sec; 72°C for 1 min 30 sec; 
and72°Cfor7min. 

A 969 base pair was isolated from a 1%% agarose gel and cloned into the pCRII- 
TOPO vector (Invitrogen) and sequenced using the ABI Big Dye Terminator Kit (P.E. 
Biosystems). See, SEQ.ID.NO.:19 for the nucleic acid sequence and SEQ.ID.NO.:20 for 
25 the putative amino acid sequence. 
Example 2 

Preparation of Non-Endogenous, Constitutively Activated Gpcrs 

Those skilled in the art are credited with the ability to select techniques for 
mutation of a nucleic acid sequence. Presented below are approaches utilized to create 
30 non-endogenous versions of several of the human GPCRs disclosed above. The 
mutations disclosed below are based upon an algorithmic approach whereby the 16 th 
amino acid (located in the IC3 region of the GPCR) from a conserved proline (or an 
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endogenous, conservative substitution therefore) residue (located in the TM6 

region of the GPCR, near the TM6/IC3 interface) is mutated, preferably to an alanine, 
histimine, arginine or lysine amino acid residue, most preferably to a lysine amino acid 
residue. 

5 1. Transformer Site-Directed™ Mutagenesis 

Preparation of non-endogenous human GPCRs may be accomplished on human 
GPCRs using, inter alia, Transformer Site-Directed™ Mutagenesis Kit (Clontech) 
according to the manufacturer instructions. In some embodiments two mutagenesis primers 
are used, preferably a lysine mutagenesis oligonucleotide that creates the lysine mutation, 
10 and a selection marker oligonucleotide. For convenience, the codon mutation to be 
incorporated into the human GPCR is also noted, in standard form (Table D): 

TABLED 



Receptor Identifier 


Codon Mutation 


hRUP28 


V274K 


hRUP29 


T249K 


hRUP30 


R232K 


hRUP31 


M294K 


hRUP32 


F220K 


hRUP34 


A238K 


hRUP35 


Y215K 


hRUP36 


L294K 


hRUP37 


T219K 



15 Example 3 

Receptor Expression 

Although a variety of cells are available to the art-skilled for the expression of 
proteins, it is preferred that mammalian cells be utilized The primary reason for this is 
20 predicated upon practicalities, i.e., utilization of, e.g., yeast cells for the expression of a 
GPCR, while possible, introduces into the protocol a non-mammalian cell which may not 
(indeed, in the case of yeast, does not) include the receptor-coupling, genetic-mechanism 
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and secretary pathways that have evolved for mammalian systems - thus, results 

obtained in non-mammalian cells, while of potential use, are not as preferred as those 
obtained using mammalian cells. Of the mammalian cells, COS-7, 293 and 293T cells 
are particularly preferred, although the specific mammalian cell utilized can be 
5 predicated upon the particular needs of the artisan. 

a. Transient Transfection 

On day one, 6xl0 6 cells/10 cm dish of 293 cells well were plated out. On day two, 
two reaction tubes were prepared (the proportions to follow for each tube are per plate): 
tube A was prepared by mixing 4|ag DNA (eg., pCMV vector, pCMV vector with receptor 

10 cDNA, etc.) in 0.5 ml serum free DMEM (Gibco BRL); tube B was prepared by mixing 
24|jl lipofectamine (Gibco BRL) in 0.5ml serum free DMEM. Tubes A and B were 
admixed by inversion (several times),, followed by incubation at room temperature for 30- 
45min. The admixture is referred to as the 'transfection mixture". Plated 293 cells were 
washed with 1XPBS, followed by addition of 5 ml serum free DMEM. One ml of the 

15 transfection mixture were added to the cells, followed by incubation for 4hrs at 37°C/5% 
CO2. The transfection mixture was removed by aspiration, followed by the addition of 
10ml ofDMEM/10% Fetal Bovine Serum. Cells were incubated at 37°C/5% C0 2 . After 
48hr incubation, cells were harvested and utilized for analysis. 

b. Stable Cell Lines 

20 Approximately 12xl0 6 293 cells will be plated on a 15cm tissue culture plate, and 

grown in DME High Glucose Medium containing 10% fetal bovine serum and one percent 
sodium pyruvate, L-glutamine, and antibiotics. Twenty-four hours following plating of 293 
cells (to approximately -80% confluency), the cells will be transfected using 12pg of DNA. 
The 12|iig of DNA is combined with 60pl of lipofectamine and 2mL of DME High Glucose 

25 Medium without serum. The medium will be aspirated from the plates and the cells washed 
once with medium without serum. The DNA, lipofectamine, and medium mixture will be 
added to the plate along with lOmL of medium without serum. Following incubation at 
37°C for four to five hours, the medium will be aspirated and 25ml of medium containing 
serum will be added. Twenty-four hours following transfection, the medium will be 

30 aspirated again, and fresh medium with serum will be added. Forty-eight hours following 
transfection, the medium will be aspirated and medium with serum will be added containing 
geneticin (G418 drug) at a final concentration of 500jxg/mL. The transfected cells will then 
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undergo selection for positively transfected cells containing the G418 resistant gene. 

The medium will be replaced every four to five days as selection occurs. During selection, 
cells will be grown to create stable pools, or split for stable clonal selection. 
Example 4 

5 Assays For determination of Constitutive Activity 
of Non-Endogenous GPCRs 

A variety of approaches are available for assessment of constitutive activity of the 
non-endogenous human GPCRs. The following are illustrative; those of ordinary skill in 
10 the art are credited with the ability to determine those techniques that are preferentially 
beneficial for the needs of the artisan. 

1 . Membrane Binding Assays: [ 35 S]GTPyS Assay 

When a G protein-coupled receptor is in its active state, either as a result of ligand 
15 binding or constitutive activation, the receptor couples to a G protein and stimulates the 
release of GDP and subsequent binding of GTP to the G protein. The alpha subunit of the 
G protein-receptor complex acts as a GTPase and slowly hydrolyzes the GTP to GDP, at 
which point the receptor normally is deactivated. Constitutively activated receptors 
continue to exchange GDP for GTP. The non-hydrolyzable GTP analog, [ 35 S]GTPyS, can 
20 be utilized to demonstrate enhanced binding of [ 35 S]GTPyS to membranes expressing 
constitutively activated receptors. Advantages of using [ 35 S]GTPyS binding to measure 
constitutive activation include but are not limited to the following: (a) it is generically 
applicable to all G protein-coupled receptors; (b) it is proximal at the membrane surface 
making it less likely to pick-up molecules which affect the intracellular cascade. 
25 The assay takes advantage of the ability of G protein coupled receptors to stimulate 

35 

[ SJGTPyS binding to membranes expressing the relevant receptors. The assay can, 
therefore, be used in the direct identification method to screen candidate compounds to 
constitutively activated G protein-coupled receptors. The assay is generic and has 
application to drug discovery at all G protein-coupled receptors. 
30 The [ 35 S]GTPyS assay is incubated in 20 mM HEPES and between 1 and about 

20mM MgCb (this amount can be adjusted for optimization of results, although 20mM is 
preferred) pH 7.4, binding buffer with between about 0.3 and about 1.2 nM [ 35 S]GTPyS 
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(this amount can be adjusted for optimization of results, although 1.2 is preferred ) and 

12.5 to 75 jig membrane protein (e.g., 293 cells expressing the G s Fusion Protein; this 
amount can be adjusted for optimization) and 10 pM GDP (this amount can be changed for 
optimization) for 1 hour. Wheatgerm agglutinin beads (25 jjI; Amersham) will then be 
5 added and the mixture incubated for another 30 minutes at room temperature. The tubes 
will be then centrifuged at 1500 x g for 5 minutes at room temperature and then counted in 
a scintillation counter. 

2. Adenylyl Cyclase 
A Flash Plate™ Adenylyl Cyclase kit (New England Nuclear; Cat. No. SMP004A) 

10 designed for cell-based assays can be modified for use with crude plasma membranes. The 
Flash Plate wells can contain a scintillant coating which also contains a specific antibody 
recognizing cAMP. The cAMP generated in the wells can be quantitated by a direct 
competition for binding of radioactive cAMP tracer to the cAMP antibody. The following 
serves as a brief protocol for the measurement of changes in cAMP levels in whole cells 

1 5 that express the receptors. 

Transfected cells will be harvested approximately twenty four hours after transient 
transfection. Media will be carefully aspirated and discarded. Ten ml of PBS will gently be 
added to each dish of cells followed by careftd aspiration. One ml of Sigma cell 
dissociation buffer and 3ml of PBS will be added to each plate. Cells will be pipetted off 

20 the plate and the cell suspension collected into a 50ml conical centrifuge tube. Cells will be 
centrifuged at room temperature at 1,100 rpm for 5 min. The cell pellet will be carefully re- 
suspended into an appropriate volume of PBS (about 3ml/plate). The cells will be then 
counted using a hemocytometer and additional PBS will be added to give the appropriate 
number of cells (to a final volume of about 50jxl/well). 

25 cAMP standards and Detection Buffer (comprising 1 \iCi of tracer [ I25 I cAMP (50 

|nl] to 11 ml Detection Buffer) will be prepared and maintained in accordance with the 
manufacturer's instructions. Assay Buffer will be prepared fresh for screening and 
contained 50pl of Stimulation Buffer, 3pl of test compound (12pM final assay 
concentration) and 50jj1 cells, Assay Buffer will be stored on ice until utilized. The assay 

30 will be initiated by addition of 50^1 of cAMP standards to appropriate wells followed by 
addition of 50pl of PBSA to wells H-ll and H12. Fifty of Stimulation Buffer will be 
added to all wells. DMSO (or selected candidate compounds) will be added to appropriate 
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wells using a pin tool capable of dispensing 3 pi of compound solution, with a final assay 

concentration of 12pM test compound and lOOpl total assay volume. The cells will then be 
added to the wells and incubated for 60 min at room temperature. One hundred jxl of 
Detection Mix containing tracer cAMP will then be added to the wells. Plates will be 
5 incubated for an additional 2 hours followed by counting in a Wallac McroBeta™ 
scintillation counter. Values of cAMP/well will then be extrapolated from a standard 
cAMP curve which will be contained within each assay plate. 

3. Cell-Based cAMP for Q Coupled Target GPCRs 

10 TSHR is a G s coupled GPCR that causes the accumulation of cAMP upon 

activation. TSHR will be constitutively activated by mutating amino acid residue 623 (i.e., 
changing an alanine residue to an isoleucine residue). A Gi coupled receptor is expected to 
inhibit adenylyl cyclase, and, therefore, decrease the level of cAMP production, which can 
make assessment of cAMP levels challenging. An effective technique for measuring the 

1 5 decrease in production of cAMP as an indication of constitutive activation of a G, coupled 
receptor can be accomplished by co-transfecting, most preferably, non-endogenous, 
constitutively activated TSHR (TSHR-A623I) (or an endogenous, constitutively active G s 
coupled receptor) as a "signal enhancer" with a Gj linked target GPCR to establish a 
baseline level of cAMP. Upon creating a non-endogenous version of the G, coupled 

20 receptor, this non-endogenous version of the target GPCR is then co-transfected with the 
signal enhancer, and it is this material that can be used for screening. This approach will be 
utilized to effectively generate a signal when a cAMP assay is used; this approach is 
preferably used in the direct identification of candidate compounds against Q coupled 
receptors. It is noted that for a Q coupled GPCR, when this approach is used, an inverse 

25 agonist of the target GPCR will increase the cAMP signal and an agonist will decrease the 
cAMP signal. 

On day one, 2x1 0 4 293 cells/well will be plated out. On day two, two reaction tubes 
will be prepared (the proportions to follow for each tube are per plate): tube A will be 
prepared by mixing 2ug DNA of each receptor transfected into the mammalian cells, for a 
30 total of 4ug DNA (eg., pCMV vector; pCMV vector with mutated THSR (TSHR-A623I); 
TSHR-A623I and GPCR, etc.) in 1.2ml serum free DMEM (Irvine Scientific, Irvine, CA); 
tube B will be prepared by mixing 120^1 lipofectamine (Gibco BRL) in 1.2ml serum free 
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DMEM. Tubes A and B will then be admixed by inversion (several times), 

followed by incubation at room temperature for 30-45min The admixture is referred to as 
the "transfection mixture". Plated 293 cells will be washed with 1XPBS, followed by 
addition of IQml serum free DMEM. 2.4ml of the transfection mixture will then be added 
5 to the cells, followed by incubation for 4hrs at 37°C/5% C0 2 . The transfection mixture will 
then be removed by aspiration, followed by the addition of 25ml of DMEM/10% Fetal 
Bovine Serum. Cells will then be incubated at 37°C/5% C0 2 . After 24hr incubation, cells 
will be harvested and utilized for analysis. 

A Flash Plate™ Adenylyl Cyclase kit (New England Nuclear; Cat No. SMP004A) 

10 although designed for cell-based assays, can be modified for use with crude plasma 
membranes depending on the need of the skilled artisan. The Flash Plate wells will contain 
a scintillant coating which also contains a specific antibody recognizing cAMP. The cAMP 
generated in the wells can be quantified by a direct competition for binding of radioactive 
cAMP tracer to the cAMP antibody. The following serves as a brief protocol for the 

1 5 measurement of changes in cAMP levels in whole cells that express the receptors. 

Transfected cells will be harvested approximately twenty four hours after 
transient transfection. Media will be carefully aspirated and discarded. Ten ml of PBS 
will be gently added to each dish of cells followed by careful aspiration. One ml of 
Sigma cell dissociation buffer and 3ml of PBS will be added to each plate. Cells will be 

20 pipetted off the plate and the cell suspension will be collected into a 50ml conical 
centrifuge tube. Cells will be centrifuged at room temperature at 1,100 rpm for 5 min. 
The cell pellet will be carefully re-suspended into an appropriate volume of PBS (about 
3ml/plate). The cells will then be counted using a hemocytometer and additional PBS is 
added to give the appropriate number of cells (to a final volume of about 50|il/well). 

25 cAMP standards and Detection Buffer (comprising 1 ^Ci of tracer [ 125 I cAMP (50 

jxl] to 11 ml Detection Buffer) will be prepared and maintained in accordance with the 
manufacturer's instructions. Assay Buffer should be prepared fresh for screening and 
contained SOjlxI of Stimulation Buffer, 3|il of test compound (12|iM final assay 
concentration) and 50pl cells, Assay Buffer can be stored on ice until utilized. The assay 

30 can be initiated by addition of 50pl of cAMP standards to appropriate wells followed by 
addition of 50jal of PBSA to wells H-l 1 and H12. Fifty jul of Stimulation Buffer will be 
added to all wells'. Selected compounds (e.g., TSH) will be added to appropriate wells 
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using a pin tool capable of dispensing 3^1 of compound solution, with a final assay 

concentration of 12|nM test compound and lOOpl total assay volume. The cells will then be 
added to the wells and incubated for 60 min at room temperature. One hundred jxl of 
Detection Mix containing tracer cAMP will then be added to the wells. Plates will then be 
5 incubated additional 2 hours followed by counting in a Wallac MicroBeta scintillation 
counter. Values of cAMP/well will then be extrapolated from a standard cAMP curve 
which is contained within each assay plate. 

4. Reporter-Based Assays 

a. Cre-Luc Reporter Assay (G s -associated receptors) 

10 293 and 293T cells will be plated-out on 96 well plates at a density of 2 x 10 4 

cells per well and will be transfected using Lipofectamine Reagent (BRL) the following 
day according to manufacturer instructions. A DNA/lipid mixture will be prepared for 
each 6-well transfection as follows: 260ng of plasmid DNA in lOOfil of DMEM are 
gently mixed with 2pl of lipid in lOOjil of DMEM (the 260ng of plasmid DNA consisted 

15 of 200ng of a 8xCRE~Luc reporter plasmid, 50ng of pCMV comprising endogenous 
receptor or non-endogenous receptor or pCMV alone, and lOng of a GPRS expression 
plasmid (GPRS in pcDNA3 (Invitrogen)). The 8XCRE-Luc reporter plasmid is prepared 
as follows: vector SRIF-p-gal will be obtained by cloning the rat somatostatin promoter 
(-71/+51) at BglV-Hindm site in the pPgal-Basic Vector (Clontech). Eight (8) copies of 

20 cAMP response element will be obtained by PCR from an adenovirus template 
AdpCF126CCRE8 {see, 7 Human Gene Therapy 1883 (1996)) and cloned into the SRIF- 
p-gal vector at the Kpn-BglV site, resulting in the 8xCRE-P-gal reporter vector. The 
8xCRE-Luc reporter plasmid will be generated by replacing the beta-galactosidase gene 
in the 8xCRE-P-gal reporter vector with the luciferase gene obtained from the pGL3- 

25 basic vector (Promega) at the HindlH-BamHI site. Following 30 min. incubation at 
room temperature, the DNA/lipid mixture will be diluted with 400 jil of DMEM and 
lOOjil of the diluted mixture will be added to each well. One hundred ^1 of DMEM with 
10% FCS will be added to each well after a 4hr incubation in a cell culture incubator. 
The following day the transfected cells will be changed with 200 jil/well of DMEM with 

30 10% FCS. Eight hours later, the wells will be changed to 100 pi /well of DMEM 

without phenol red, after one wash with PBS. Luciferase activity will be measured the 

next day using the LucLite™ reporter gene assay kit (Packard) following manufacturer's 
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instructions and read on a 1450 MicroBeta™ scintillation and luminescence 

counter (Wallac). 

b. API reporter assay (G q -associated receptors) 

5 A method to detect G q stimulation depends on the known property of G q - 

dependent phospholipase C to cause the activation of genes containing API elements in 
their promoter. A Pathdetect™ AP-1 cis-Reporting System (Stratagene, Catalogue # 
219073) can be utilized following the protocol set forth above with respect to the CREB 
reporter assay, except that the components of the calcium phosphate precipitate were 410 
10 ng pAPl-Luc, 80 ng pCMV-receptor expression plasmid, and 20 ng CMV-SEAP. 

c. Srf-Luc Reporter Assay (G q - associated receptors) 

One method to detect G q stimulation depends on the known property of G q - 
dependent phospholipase C to cause the activation of genes containing serum response 
factors in their promoter. A Pathdetect™ SRF-Luc-Reporting System (Stratagene) can be 

15 utilized to assay for G q coupled activity in, e.g., COS7 cells. Cells are transfected with 
the plasmid components of the system and the indicated expression plasmid encoding 
endogenous or non-endogenous GPCR using a Mammalian Transfection™ Kit 
(Stratagene, Catalogue #200285) according to the manufacturer's instructions. Briefly, 
410 ng SRF-Luc, 80 ng pCMV-receptor expression plasmid and 20 ng CMV-SEAP 

20 (secreted alkaline phosphatase expression plasmid; alkaline phosphatase activity is 
measured in the media of transfected cells to control for variations in transfection 
efficiency between samples) are combined in a calcium phosphate precipitate as per the 
manufacturer's instructions. Half of the precipitate is equally distributed between 3 
wells in a 96-well plate, kept on the cells in a serum free media for 24 hours. The last 5 

25 hours the cells are incubated with lpM Angiotensin, where indicated. Cells are then 
lysed and assayed for luciferase activity using a Luclite™ Kit (Packard, Cat. # 6016911) 
and "Trilux 1450 Microbeta" liquid scintillation and luminescence counter (Wallac) as 
per the manufacturer's instructions. The data can be analyzed using GraphPad Prism™ 
2.0a (GraphPad Software Inc.). 

30 d. Intracellular BP3 Accumulation Assay (G q -associated receptors) 
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On day 1, cells comprising the receptors (endogenous and/or non- 

endogenous) are plated onto 24 well plates, usually lxl 0 5 cells/well (although his number 
can be optimized. On day 2 cells are transfected by firstly mixing 0.25ug DNA in 50 |il 
serum free DMEM/well and 2 pi lipofectamine in 50 |nl serum free DMEM/well. The 
5 solutions are gently mixed and incubated for 15-30 min at room temperature. Cells are then 
washed with 0.5 ml PBS and 400 |utl of serum free media and then mixed with the 
transfection media and added to the cells. The cells are incubated for 3-4 hrs at 
37°C/5%C02 and then the transfection media is removed and replaced with lml/well of 
regular growth media. On day 3 the cells are labeled with 3 H-myo-inositol. Briefly, the 

10 media is removed and the cells are washed with 0.5 ml PBS. Then 0.5 ml inositol- 
free/serum free media (GD3CO BRL) are added/well with 0.25 p.Ci of ^-myo-inositol/ 
well and the cells incubated for 16-18 hrs overnight at 37°C/5%C0 2 . On Day 4 the cells are 
washed with 0.5 ml PBS and 0.45 ml of assay medium is added containing inositol- 
free/serum free media 10 \iM pargyline 10 roM lithium chloride or 0.4 ml of assay medium 

15 and 50 pi of lOx ketanserin (ket) to final concentration of lO^M. The cells are then 
incubated for 30 min at 37°C. The cells are then washed with 0.5 ml PBS and 200 jxl of 
fresh/ice cold stop solution (1M KOH; 18 mM Na-borate; 3.8 mM EDTA) is added to each 
well. The solution is kept on ice for 5-10 min (or until cells are lysed) and then neutralized 
by 200 \il of fresh/ice cold neutralization solution (7.5 % HCL). The lysate is then 

20 transferred into 1.5 ml Eppendorf tubes and 1 ml of chloroform/methanol (1:2) is 
added/tube. The solution is vortexed for 15 sec and the upper phase is applied to a Biorad 
AG1-X8™ anion exchange resin (100-200 mesh). First, the resin is washed with water at 
1:1.25 W/V and 0.9 ml of upper phase is loaded onto the column. The column is then 
washed with 10 ml of 5 mM myo-inositol and 10 ml of 5 mM Na-borate/60mM Na~ 

25 formate. The inositol tris phosphates are eluted into scintillation vials containing 10 ml of 
scintillation cocktail with 2 ml of 0.1 M formic acid/ 1 M ammonium formate. The 
columns are regenerated by washing with 10 ml of 0.1 M formic acid/3M ammonium 
formate and rinsed twice with dd H 2 0 and stored at 4°C in water. 

Reference is made to Figure 1. In Figure 1, 293 cells were transfected with G q 

30 protein containing a six amino acid deletion, "G q (del)"; G q protein fused to a Q protein, 
"G q (del)/Gi"; endogenous RUP32; and RUP32 with G q (del) ("RUP32+ G q (del)/Gj"). The 
data indicate, based upon measuring D? 3 accumulation of RUP32 co-transfection of 
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G q (del)/Gi, that RUP32 does not endogenously couple to G q protein. 

However when RUP32 was co-transfected with G q (del)/Q fusion protein, RUP32 was 
forced to couple to G q protein. RUP27+ G q (del)/Gi evidence about a nine (9) fold increase 
in IP3 accumulation when compared to endogenous RUP32. This data demonstrates that 
5 the G q (del)/Gi Fusion Construct can be co-transfected with a GPCR and used to screen for 
agonists or inverse agonists. 

Reference is made to Figure 2. In Figure 2, 293 cells were transfected with RUP35 
and RUP36 receptor and compared to the control, pCMV. The data indicate that both 
RUP35 and RUP36 receptor are endogenously, constitutively active. RUP35 evidences 
10 about a six (6) fold increase in intracellular inositol phosphate accumulation when 
compared to pCMV and RUP36 evidences about a four (4) fold increase when compared to 
pCMV. 
Example 5 

Fusion Protein Preparation 
15 a. GPCR: G s Fusion Construct 

The design of the constitutively activated GPCR-G protein fusion construct can be 
accomplished as follows: both the 5' and 3' ends of the rat G protein G s a (long form; Itoh, 
H. et aL, 83 PNAS 3776 (1986)) is engineered to include a Hindm (5'-AAGCTT-3') 

20 sequence thereon. Following confirmation of the correct sequence (including the flanking 
Hindm sequences), the entire sequence is shuttled into pcDNA3.1(-) (Invitrogen, cat. no. 
V795-20) by subcloning using the HindlE restriction site of that vector. The correct 
orientation for the G s a sequence will be determined after subcloning into pcDNA3.1(-). 
The modified pcDNA3.1(-) containing the rat G s oc gene at Hindm sequence is then verified; 

25 this vector will then be available as a '^universal" G s a protein vector. The pcDNA3.1(-) 
vector contains a variety of well-known restriction sites upstream of the Hindm site, thus 
beneficially providing the ability to insert, upstream of the G s protein, the coding sequence 
of an endogenous, constitutively active GPCR. This same approach can be utilized to 
create other "universal" G protein vectors, and, of course, oilier commercially available or 

30 proprietary vectors known to the artisan can be utilized. In some embodiments, the 
important criteria is that the sequence for the GPCR be upstream and in-frame with that of 
the G protein. 
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Spacers in the restriction sites between the G protein and the GPCR are 

optional. The sense and anti-sense primers included the restriction sites for Xbal and 
EcoRV, respectively, such that spacers (attributed to the restriction sites) exist between the 
G protein and the GPCR. 
5 PCR will then be utilized to secure the respective receptor sequences for fusion 

within the G s a universal vector disclosed above, using the following protocol for each: 
lOOng cDNA for GPCR will be added to separate tubes containing 2|d of each primer 
(sense and anti-sense), 3 pi of lOmM dNTPs, 10^1 of lOXTaqPlus™ Precision buffer, lpl of 
TaqPlus™ Precision polymerase (Stratagene: #600211), and 80|nl of water. Reaction 

10 temperatures and cycle times for the GPCR will be as follows with cycle steps 2 through 4 
were repeated 35 times: 94°C for 1 min; 94°C for 30 seconds; 62°C for 20 sec; 72°C 1 min 
40sec; and 72°C 5 min. PCR products will be run on a 1% agarose gel and then purified. 
The purified products will be digested with Xbal and EcoRV and the desired inserts 
purified and ligated into the G s universal vector at the respective restriction sites. The 

15 positive clones will be isolated following transformation and determined by restriction 
enzyme digestion; expression using 293 cells will be accomplished following the protocol 
set forth infra. Each positive clone for GPCR- G s Fusion Protein will be sequenced to 
verify correctness. 

b. G q (6 amino acid deletion)/G] Fusion Construct 

20 The design of a G q (del)/Gi fusion construct was accomplished as follows: the N- 

t.erminal six (6) amino acids (amino acids 2 through 7), having the sequence of TLESIM 
(SEQ.ID.NO.:47) Gocq-subunit was deleted and the C-terminal five (5) amino acids, having 
the sequence EYNLV (SEQ.ID.NO.:48) was replaced with the corresponding amino acids 
of the God Protein, having the sequence DCGLF (SEQ.ID.NO.:49). This fusion construct 

25 was obtained by PCR using the following primers: 

5 '-gatcAAGCTTCCATGGCGTGCTGCCTGAGCGAGG-3' (SEQ.ID.NO.:50) and 
S'-gatcGGATCCTTAGAACAGGCCGCAGTCCTTCAGGTTCAGCTGCAGGATGGTG-S' 
(SEQ.ID.NO.:51) and Plasmid 63313 which contains the mouse Gaq-wild type version with z 
hemagglutinin tag as template. Nucleotides in lower caps are included as spacers. 

30 TaqPlus® Precision DNA polymerase (Stratagene) was utilized for the 

amplification by the following cycles, with steps 2 through 4 repeated 35 times: 95°C for 

2 min; 95°C for 20 sec; 56°C for 20 sec; 72°C for 2 min; and 72°C for 7 min. The PCR 
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product will be cloned into a pCRH-TOPO vector (Invitrogen) and sequenced using the 

ABI Big Dye Terminator kit (P.R Biosystems). Inserts from a TOPO clone containing 
the sequence of the fusion construct will be shuttled into the expression vector 
pcDNA3.1(+) at the Hindm/BamHI site by a 2 step cloning process. 
Example 6 

Tissue Distribution of the disclosed human GPCRs: RT-PCR 



RT-PCR was applied to confirm the expression and to determine the tissue 
distribution of several novel human GPCRs. Oligonucleotides utilized were GPCR- 

10 specific and the human multiple tissue cDNA panels (MTC, Clontech) as templates. Taq 
DNA polymerase (Stratagene) were utilized for the amplification in a 40pl reaction 
according to the manufacturer's instructions. Twenty \il of the reaction will be loaded on 
a 1.5% agarose gel to analyze the RT-PCR products. Table E, below, lists the receptors, 
the cycle conditions and the primers utilized, and also lists exemplary diseases/disorders 

1 5 linked to the receptors. 



TABLE E 



Receptor 
Identifier 


Cycle 
Conditions 
Min ('), Sec (") 
Cycles 2-4 
repeated 35 
times 


5' Primer 
(SEQJD.NO.) 


3' Primer 
(SEQJD.NO.) 


DNA 
Fragment 


Tissue Expression 


hRUP28 


94°Cfor5min; 
94°Cfor30 sec; 
58°Cfor30 sec, 
72°C for 1 min, 
and72°Cfor7 
min 


GTCCTCACT 
GGTGGCCAT 
GTACTCC(52) 


CTGCGTCCAC 

CAGAGTCAC 

GTCTCC(53) 


710bp 


heart; kidney; liver, lung 
and pancreas 


hRUP29 


94°Cfor5min; 
94°Cfor30sec; 
58°Cfor30 sec, 
72°Cfor lmin, 


CTTGGATGTT 
TGGGCTGCC 
CTTCTGC (54) 


GTTTGTGGCT 
AACGGCACA 
AAACACAAT 
TCC(55) 


690bp 


leukocyte and ovary 
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and72°Cfor7 
min 










hRUP30 


94°Cfor2min; 
94°Cforl5sec; 
58°Cfor20sec, 
72°C for 1 min, 
and72°CforlO 
min 


CTGCTCACG 
GTTGACCGC 
TACACTGC 
(56) 


GTGGCCATG 
AGCCACCCT 
GAGCTCC (57) 


690bp 


pancreas 


hRUP31 


95Xfor4min; 
95°C for 1 min; 
52°Cfbr30sec, 
72°C fori min, 
and72°Cfor7 
min 


CTTCTTCTCC 
GACGTCAAG 
G (58) 


CCAAATCA 
GTGTGCAA 
ATCG(59) 


516bp 


colon, lung, pancreas, 
thymus; cerebral cortex, 
hippocampus of brain, 
and fat cells 


hRUP32 


95°Cfor4min; 
95°C for 1 min; 
52°Cfor30 sec, 
72°C for 1 min, 
and72°Cfor7 
min 


TGAATGGGT 
CCTGTGTGA 
AA (60) 


CAACGGTCT 
GACAACCTC 
CT(61) 


527bp 


thymus 


hRUP34 


95°Cfor4min; 
95°C for 1 min; 
52°Cfor30 sec, 
72°Cfor Imin, 
and72°Cfor7 
min 


TTGCTGTGAT 
GTGGCATTTT 
G(62) 


CAGGAAGCC 
CATAAAGGC 
ATCAA(63) 


534bp 


peripheral blood 
leukocyte ("PBL"), 
prostate and kidney 


hRUP35 


95°Cfor4min; 
95°C for 1 min; 
52°Cfor30sec, 
72°C for 1 min, 
and72°Cfor7 
min 


ACATCACCT 
GCTTCCTGA 
CC(64) 


CCAGCATCTT 

GATGCAGTG 

T(65) 


557bp 


thalamus 


hRUP37 


95°Cfor4min; 
95°C for 1 min; 
52°Cfor30sec, 
72°C for 1 min, 


CCATCTCCA 
AAATCCTCA 
GTC(66) 


GCTGTTAAG 
AGCGGACAG 
GAAA (67) 


517bp 


testis, cerebral cortex 
and hippocampus 
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and72°Cfor7 
min 











Diseases and disorders related to receptors located in these tissues or regions 
include, but are not limited to, cardiac disorders and diseases (e.g. thrombosis, myocardial 
infarction; atherosclerosis; cardiomyopathies); kidney disease/disorders (e.g., renal failure; 
5 renal tubular acidosis; renal glycosuria; nephrogenic diabetes insipidus; cystinuria; 
polycystic kidney disease); eosinophilia; leukocytosis; leukopenia; ovarian cancer, sexual 
dysfunction; polycystic ovarian syndrome; pancreatitis and pancreatic cancer; irritable 
bowel syndrome; colon cancer, Crohn's disease; ulcerative colitis; diverticulitis; Chronic 
Obstructive Pulmonary Disease (COPD); Cystic Fibrosis; pneumonia; pulmonary 

10 hypertension; tuberculosis and lung cancer, Parkinson's disease; movement disorders and 
ataxias; learning and memory disorders; eating disorders (e.g., anorexia; bulimia, etc.); 
obesity; cancers; thymoma; myasthenia gravis; circulatory disorders; prostate cancer; 
prostatitis; kidney disease/disorders(e.g., renal failure; renal tubular acidosis; renal 
glycosuria; nephrogenic diabetes insipidus; cystinuria; polycystic kidney disease); 

15 sensorimotor processing and arousal disorders; obsessive-compulsive disorders; testicular 
cancer; priapism; prostatitis; hernia; endocrine disorders; sexual dysfunction; allergies; 
depression; psychotic disorders; migraine; reflux; schizophrenia; ulcers; bronchospasm; 
epilepsy; prostatic hypertrophy; anxiety; rhinitis; angina; and glaucoma. Accordingly, the 
methods of the present invention may also be useful in the diagnosis and/or treatment of 

20 these and other diseases and disorders. 

Example 7 

Protocol: Direct Identification of Inverse Agonists and Agonists 
A. [ 35 S]GTPyS Assay 

25 Although endogenous, constitutively active GPCRs have been used for the direct 

identification of candidate compounds as, e.g., inverse agonists, for reasons that are not 
altogether understood, intra-assay variation can become exacerbated. In some 
embodiments a GPCR Fusion Protein, as disclosed above, is also utilized with a non- 
endogenous, constitutively activated GPCR. When such a protein is used, intra-assay 

30 variation appears to be substantially stabilized, whereby an effective signal-to-noise ratio is 
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obtained. This has the beneficial result of allowing for a more robust identification of 

candidate compounds. Thus, in some embodiments it is preferred that for direct 
identification, a GPCR Fusion Protein be used and that when utilized, the following assay 
protocols be utilized. 
5 1. Membrane Preparation 

Membranes comprising the constitutively active orphan GPCR Fusion Protein of 
interest and for use in the direct identification of candidate compounds as inverse agonists 
or agonists are preferably prepared as follows: 

a. Materials 

10 'Membrane Scrape Buffer" is comprised of 20mM HEPES and lOmM EDTA, 

pH 7.4; 'Membrane Wash Buffer" is comprised of 20 mM HEPES and 0.1 mM EDTA, 
pH 7.4; 'Binding Buffer" is comprised of 20mM HEPES, 100 mM NaCl, and 10 mM 
MgCl 2 ,pH7.4 

b. Procedure 

15 All materials will be kept on ice throughout the procedure. Firstly, the media will 

be aspirated from a confluent monolayer of cells, followed by rinse with 10ml cold PBS, 
followed by aspiration. Thereafter, 5ml of Membrane Scrape Buffer will be added to scrape 
cells; this will be followed by transfer of cellular extract into 50ml centrifuge tubes 
(centrifiiged at 20,000 rpm for 17 minutes at 4°C). Thereafter, the supernatant will be 

20 aspirated and the pellet will be resuspended in 30ml Membrane Wash Buffer followed by 
centrifugation at 20,000 ipm for 17 minutes at 4°C. The supernatant will then be aspirated 
and the pellet resuspended in Binding Buffer. The resuspended pellet will then be 
homogenized using a Brinkman Polytron™ homogenizer (15-20 second bursts until the 
material is in suspension). This is referred to herein as "Membrane Protein". 

25 2. Bradford Protein Assay 

Following the homogenization, protein concentration of the membranes will be 
determined, for example, using the Bradford Protein Assay (protein can be diluted to 
about 1.5mg/ml, aliquoted and frozen (-80°C) for later use; when frozen, protocol for use 
will be as follows: on the day of the assay, frozen Membrane Protein is thawed at room 

30 temperature, followed by vortex and then homogenized with a Polytron at about 12 x 
1,000 rpm for about 5-10 seconds; it was noted that for multiple preparations, the 
homogenizer is thoroughly cleaned between homogenization of different preparations). 
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a. Materials 

Binding Buffer (as discussed above); Bradford Dye Reagent; Bradford Protein 
Standard will be utilized, following manufacturer instructions (Biorad, cat. no. 500- 
0006). 

5 b. Procedure 

Duplicate tubes will be prepared, one including the membrane, and one as a 
control "blank". Each contains 800pl Binding Buffer. Thereafter, 10pl of Bradford 
Protein Standard (lmg/ml) will be added to each tube, and 10|iil of membrane Protein 
will then be added to just one tube (not the blank). Thereafter, 200pl of Bradford Dye 
10 Reagent will be added to each tube, followed by vortexing. After five minutes, the tubes 
will be re-vortexed and the material therein will be transferred to cuvettes. The cuvettes 
will then be read using a CECIL 3041 spectrophotometer, at wavelength 595. 
3. Direct Identification Assay 

a. Materials 

1 5 GDP Buffer consisted of 37.5 ml Binding Buffer and 2mg GDP (Sigma, cat. no. G- 

7127), followed by a series of dilutions in Binding Buffer to obtain 0.2 pM GDP (final 
concentration of GDP in each well was 0.1 pM GDP); each well comprising a candidate 
compound, has a final volume of 200pl consisting of IOOjllI GDP Buffer (final 
concentration, O.lpM GDP), 50pl Membrane Protein in Binding Buffer, and 50pl 

20 [ 35 S]GTPyS (0.6 nM) in Binding Buffer (2.5 jol [ 35 S]GTPyS per 10ml Binding Buffer). 

b. Procedure 

Candidate compounds will be preferably screened using a 96-well plate format 
(these can be frozen at -80°C). Membrane Protein (or membranes with expression vector 
excluding the GPCR Fusion Protein, as control), will be homogenized briefly until in 

25 suspension. Protein concentration will then be determined using, for example, the Bradford 
Protein Assay set forth above. Membrane Protein (and controls) will then be diluted to 
0.25mg/ml in Binding Buffer (final assay concentration, 12.5pg/well). Thereafter, 100 pi 
GDP Buffer is added to each well of a Wallac Scintistrip™ (Wallac). A 5pl pin-tool will 
then be used to transfer 5 pl of a candidate compound into such well (z.e., 5pl in total assay 

30 volume of 200 pl is a 1 :40 ratio such that the final screening concentration of the candidate 
compound is IOjuM). Again, to avoid contamination, after each transfer step the pin tool is 
rinsed in three reservoirs comprising water (IX), ethanol (IX) and water (2X) - excess 
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liquid is shaken from the tool after each rinse and the tool is dried with paper and Kim 

wipes. Thereafter, 50 pi of Membrane Protein will be added to each well (a control well 

comprising membranes without the GPCR Fusion Protein was also utilized), and pre- 

incubated for 5-10 minutes at room temperature. Thereafter, 50 pi of [ 35 S]GTPyS (0.6 nM) 

5 in Binding Buffer will be added to each well, followed by incubation on a shaker for 60 

minutes at room temperature (again, in this example, plates were covered with foil). The 

assay will be stopped by spinning the plates at 4000 RPM for 15 minutes at 22°C. The 

plates will then be aspirated with an 8 channel manifold and sealed with plate covers. The 

plates will then be read on a Wallac 1450 using setting "Prot. #37" (as per manufacturer's 

10 instructions). 

B. Cyclic AMP Assay 

Another assay approach to directly identify candidate compound will be 
accomplished utilizing a cyclase-based assay. In addition to direct identification, this assay 

15 approach can be utilized as an independent approach to provide confirmation of the results 
from the [ 35 S]GTPyS approach as set forth above. 

A modified Flash Plate™ Adenylyl Cyclase kit (New England Nuclear; Cat. No. 
SMP004A) will be preferably utilized for direct identification of candidate compounds as 
inverse agonists and agonists to GPCRs in accordance with the following protocol. 

20 Transfected cells will be harvested approximately three days after transfection. 

Membranes will be prepared by homogenization of suspended cells in buffer containing 
20mM HEPES, pH 7.4 and lOmM MgCk. Homogenization will be performed on ice using 
a Brinkman Polytron™ for approximately 10 seconds. The resulting homogenate will be 
centrifuged at 49,000 X g for 15 minutes at 4°C. The resulting pellet will then be 

25 resuspended in buffer containing 20mM HEPES, pH 7.4 and 0. 1 mM EDTA, homogenized 
for 10 seconds, followed by centrifugation at 49,000 X g for 15 minutes at 4°C. The 
resulting pellet will then be stored at -80°C until utilized. On the day of direct identification 
screening, the membrane pellet will slowly be thawed at room temperature, resuspended in 
buffer containing 20mM HEPES, pH 7.4 and lOmM MgCl 2 , to yield a final protein 

30 concentration of 0.60mg/ml (the resuspended membranes will be placed on ice until use). 

cAMP standards and Detection Buffer (comprising 2 p.Ci of tracer [ 125 I cAMP (100 
to 11 ml Detection Buffer) will be prepared and maintained in accordance with the 
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manufacturer's instructions. Assay Buffer will be prepared fresh for screening and 

contain 20mM HEPES, pH 7.4, lOmM MgCl 2 , 20mM phosphocreatine (Sigma), 0.1 
units/ml creatine phosphokinase (Sigma), 50 pM GTP (Sigma), and 0.2 mM ATP (Sigma); 
Assay Buffer will be stored on ice until utilized. 
5 Candidate compounds identified as per above (if frozen, thawed at room 

temperature) will be added, preferably, to 96-well plate wells (3|xl/well; 12|iM final assay 
concentration), together with 40 jil Membrane Protein (30jig/well) and 50j±l of Assay 
Buffer. This admixture will be incubated for 30 minutes at room temperature, with gentle 
shaking. 

10 Following the incubation, lOOjil of Detection Buffer will be added to each well, 

followed by incubation for 2-24 hours. Plates will then be counted in a Wallac 
MicroBeta™ plate reader using 'Trot. #31" (as per manufacturer instructions). 
C. Melanophore Screening Assay 

A method for identifying candidate agonists or inverse agonists for a GPCR can be 
15 preformed by introducing tests cells of a pigment cell line capable of dispersing or 
aggregating their pigment in response to a specific stimulus and expressing an exogenous 
clone coding for the GCPR. A stimulant, e.g., fight, sets an initial state of pigment 
disposition wherein the pigment is aggregated within the test cells if activation of the GPCR 
induces pigment dispersion. However, stimulating the cell with a stimulant to set an initial 
20 state of pigment disposition wherein the pigment is dispersed if activation of the GPCR 
induces pigment aggregation. The tests cells are then contacted with chemical compounds, 
and it is determined whether the pigment disposition in the cells changed from the initial 
state of pigment disposition. Dispersion of pigments cells due to the candidate compound 
coupling to the GPCR will appear dark on a petri dish, while aggregation of pigments cells 
25 will appear fight. 

Materials and methods will be followed according to the disclosure of U.S. Patent 
Number 5,462,856 and U.S. Patent Number 6,051,386, each of which are incorporated by 
reference. 

Although a variety of expression vectors are available to those in the art, for 
30 purposes of utilization for both the endogenous and non-endogenous human GPCRs, in 
some embodiments it is preferred that the vector utilized be pCMV. This vector was 
deposited with the American Type Culture Collection (ATCC) on October 13, 1998 (10801 
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University Blvd., Manassas, VA 20110-2209 USA) under the provisions of the Budapest 

Treaty for tihe International Recognition of the Deposit of Microorganisms for the Purpose 
of Patent Procedure. The DNA was tested by the ATCC and determined to be viable. The 
ATCC has assigned the following deposit number to pCMV: ATCC #203351. 
5 References cited throughout this patent document, including co-pending and related 

patent applications, unless otherwise indicated, are fully incorporated herein by reference. 
Modifications and extension of the disclosed inventions that are within the purview of the 
skilled artisan are encompassed within the above disclosure and the claims that follow. 
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CLAIMS 

What is claimed is: 

1 . AG protein-coupled receptor encoded by an amino acid sequence of 
5 SEQJ0D.NO.:2. 

2. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 1. 

3. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.: 1. 

4. A host cell comprising the plasmid of claim 3. 

10 5 . A G protein-coupled receptor encoded by an amino acid sequence of 
SEQID.NO.:4. 

6. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claimS. 

7. A plasmid comprising a vector and the cDNA of SEQ.ID.NO. :3 . 
15 8. A host cell comprising the plasmid of claim 7, 

9. AG protein-coupled receptor encoded by an amino acid sequence of 
SEQID.NO.:6. 

10. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 9. 

20 11. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:5. 

12. A host cell comprising the plasmid of claim 11. 

13. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:8. 

14. A non-endogenous, constitutively activated version of the G protein-coupled 
25 receptor of claim 1 3 . 

15. A plasmid comprising a vector and the cDNA of SEQ.E).NO.:7. 

16. A host cell comprising the plasmid of claim 15. 

17. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.: 10. 

30 1 8. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 17 . 
19. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:9. 
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20. A host cell comprising the plasmid of claim 19. 

21. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:12. 

22. A non-endogenous, constitutively activated version of the G protein-coupled 
5 receptor of claim 2 1 . 

23 . A plasmid comprising a vector and the cDNA of SEQ.ID.NO. : 1 1 . 

24. A host cell comprising the plasmid of claim 23. 

25. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.nXNO.:K 

10 26. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 25. 

27. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.:13. 

28. A host cell comprising the plasmid of claim 27. 

29. A G protein-coupled receptor encoded by an amino acid sequence of 
15 SEQ.ID.NO.:16. 

30. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 29. 

3 1 . A plasmid comprising a vector and the cDNA of SEQ.ID.NO. : 1 5. 

32. A host cell comprising the plasmid of claim 3 1 . 

20 33 . A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:18. 

34. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 33. 

35. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.: 17. 
25 36. A host cell comprising the plasmid of claim 35. 

37. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO.:20. 

38. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 37. 

30 39. Aplasmid comprising a vector and the cDNA of SE.ID.NO.:19. 
40. A host cell comprising the plasmid of claim 39. 
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SEQUENCE LISTING 

<110> Arena Pharmaceuticals, Inc. 

<120> Endogenous And Non-Endogenous, Constitutively Activated Human G Protein-Coupled 
Receptors 

<130> AREN-0309 

<150> 09/170,496 

<151> 1998-10-13 

<150> PCT/US99/23938 

<151> 1998-10-13 

<150> 60/253,404 

<151> 2000-11-27 

<150> 60/255,366 

<151> 2000-12-12 

<150> 60/270,286 

<151> 2001-02-20 

<150> 60/282,365 

<151> 2001-04-06 

<150> 60/270,266 

<151> 2001-02-20 

<150> 60/282,032 

<151> 2001-04-06 

<150> 60/282,358 

<151> 2001-04-06 

<150> 60/282,356 

<151> 2001-04-06 

<150>, 60/290, 917 

<151> 2001-05-14 

<150> 60/309,208 

<151> 2001-07-31 

<160> 67 

<170> Patentln version- 3.1 

<210> 1 

<211> 1002 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 1 

atgtggagct gcagctggtt caacggcaca gggctggtgg aggagctgcc tgcctgccag 60 
gacctgcagc tggggctgtc actgttgtcg ctgctgggcc tggtggtggg cgtgccagtg 120 
ggcctgtgct acaacgccct gctggtgctg gccaacctac acagcaaggc cagcatgacc 180 
atgccggacg tgtactttgt caacatggca gtggcaggcc tggtgctcag cgccctggcc 240 
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cctgtgcacc 


tgctcggccc 


cccgagctcc 


cggtgggcgc tgtggagtgt gggcggcgaa 


300 


gtccacgtgg 


cactgcagat 


ccccttcaat 


gtgtcctcac tggtggccat gtactccacc 


360 


gccctgctga 


gcctcgacca 


ctacatcgag 


cgtgcactgc cgcggaccta catggccagc 


420 


gtgtacaaca 


cgcggcacgt 


gtgcggcttc 


gtgtggggtg gcgcgctgct gaccagcttc 


480 


tcctcgctgc 


tcttctacat 


ctgcagccat 


gtgtccaccc gcgcgctaga gtgcgccaag 


540 


atgcagaacg 


cagaagctgc 


cgacgccacg 


ctggtgttca tcggctacgt ggtgccagca 


600 


ctggccaccc 


tctacgcgct 


ggtgctactc 


tcccgcgtcc gcagggagga cacgcccctg 


660 


gaccgggaca 


cgggccggct 


ggagccctcg 


gcacacaggc tgctggtggc caccgtgtgc 


720 


acgcagtttg 


ggctctggac 


gccacactat 


ctgatcctgc tggggcacac ggtcatcatc 


780 


tcgcgaggga 


agcccgtgga 


cgcacactac 


ctggggctac tgcactttgt gaaggatttc 


840 


tccaaactcc 


tggccttctc 


cagcagcttt 


gtgacaccac ttctctaccg ctacatgaac 


900 


cagagcttcc 


ccagcaagct 


ccaacggctg 


atgaaaaagc tgccctgcgg ggaccggcac 


960 


tgctccccgg 


accacatggg 


ggtgcagcag 


gtgctggcgt ag 


1002 



<210> 2 

<211> 333 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 2 

Met Trp Ser Cys Ser Trp Phe Asn Gly Thr Gly Leu Val Glu Glu Leu 
1 5 10 15 

Pro Ala Cys Gin Asp Leu Gin Leu Gly Leu Ser Leu Leu Ser Leu Leu 
20 25 30 

Gly Leu Val Val Gly Val Pro Val Gly Leu Cys Tyr Asn Ala Leu Leu 
35 40 45 

Val Leu Ala Asn Leu His Ser Lys Ala Ser Met Thr Met Pro Asp Val 
50 55 60 

Tyr Phe Val Asn Met Ala Val Ala Gly Leu Val Leu Ser Ala Leu Ala 
65 70 75 80 

Pro Val His Leu Leu Gly Pro Pro Ser Ser Arg Trp Ala Leu Trp Ser 
85 90 95 

Val Gly Gly Glu Val His Val Ala Leu Gin He Pro Phe Asn Val Ser 
100 105 110 

Ser Leu Val Ala Met Tyr Ser Thr Ala Leu Leu Ser Leu Asp His Tyr 
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115 



120 



125 



He Glu Arg Ala Leu Pro Arg Thr Tyr Met Ala Ser Val Tyr Asn Thr 
130 135 140 



Arg His Val Cys Gly Phe Val Trp Gly Gly Ala Leu Leu Thr Ser Phe 
145 150 155 160 



Ser Ser Leu Leu Phe Tyr He Cys Ser His Val Ser Thr Arg Ala Leu 
165 170 175 



Glu Cys Ala Lys Met Gin Asn Ala Glu Ala Ala Asp Ala Thr Leu Val 
180 185 190 



Phe He Gly Tyr Val Val Pro Ala Leu Ala Thr Leu Tyr Ala Leu Val 
195 200 205 



Leu Leu Ser Arg Val Arg Arg Glu Asp thr Pro Leu Asp Arg Asp Thr 
210 215 220 



Gly Arg Leu Glu Pro Ser Ala His Arg Leu Leu Val Ala Thr Val Cys 
225 230 235 240 



Thr Gin Phe Gly Leu Trp Thr Pro His Tyr Leu He Leu Leu Gly His 
245 250 255 



Thr Val He He Ser Arg Gly Lys Pro Val Asp Ala His Tyr Leu Gly 
260 265 " 270 



Leu Leu His Phe Val Lys Asp Phe Ser Lys Leu Leu Ala Phe Ser Ser 
275 280 285 



Ser Phe Val Thr Pro Leu Leu Tyr Arg Tyr Met Asn Gin Ser Phe Pro 
290 295 300 



Ser Lys Leu Gin Arg Leu Met Lys Lys Leu Pro Cys Gly Asp Arg His 
305 310 315 320 



Cys Ser Pro Asp His Met Gly Val Gin Gin Val Leu Ala 
325 330 



<210> 3 

<211> 918 

<212> DNA 

<213> Homo sapiens 

<400> 3 

atgcctggcc acaatacctc caggaattcc tcttgcgatc ctatagtgac accccactta 

atcagcctct acttcatagt gcttattggc gggctggtgg gtgtcatttc cattcttttc 

ctcctggtga aaatgaacac ccggtcagtg accaccatgg cggtcattaa cttggtggtg 
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gtccacagcg 


tttttctgct 


gacagtgcca 


tttcgcttga cctacctcat caagaagact 


240 


tggatgtttg 


ggctgccctt 


ctgcaaattt 


gtgagtgcca tgctgcacat ccacatgtac 


300 


ctcacgttcc 


tattctatgt 


ggtgatcctg 


gtcaccagat acctcatctt cttcaagtgc 


360 


aaagacaaag 


tggaattcta 


cagaaaactg 


catgctgtgg ctgccagtgc tggcatgtgg 


420 


acgctggtga 


ttgtcattgt 


ggtacccctg 


gttgtctccc ggtatggaat ccat.gaggaa 


480 


tacaatgagg 


agcactgttt 


taaatttcac 


aaagagcttg cttacacata tgtgaaaatc 


540 


atcaactata 


tgatagtcat 


ttttgtcata 


gccgttgctg tgattctgtt ggtcttccag 


600 


gtcttcatca 


ttatgttgat 


ggtgcagaag 

ZJ -J ZJ J ZJ 


ctacgccact ctttactatc ccaccaggag 


660 


ttctgggctc 


agctgaaaaa 


cctatttttt 


ataggggtca tccttgtttg tttccttccc 


720 


taccagttct 


ttaggatcta 


ttacttgaat 


gttgtgacgc attccaatgc ctgtaacagc 


780 


aaggttgcat 


tttataacga 


aatcttcttg 


agtgtaacag caattagctg ctatgatttg 


840 


cttctctttg 


tctttggggg 


aagccattgg 


tttaagcaaa agataattgg cttatggaat 


900 


tgtgttttgt 


gccgttag 






918 



<210> 4 

<211> 305 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Pro Gly His Asn Thr Ser Arg Asn Ser Ser Cys Asp Pro lie Val 
1 5 10 15 



Thr Pro His Leu He Ser Leu Tyr Phe He Val Leu He Gly Gly Leu 
20 25 30 



Val Gly Val He Ser He Leu Phe Leu Leu Val Lys Met Asn Thr Arg 
35 40 45 



Ser Val Thr Thr Met Ala Val He Asn Leu Val Val Val His Ser Val 
50 55 60 



Phe Leu Leu Thr Val Pro Phe Arg Leu Thr Tyr Leu He Lys Lys Thr 
65 70 75 80 



Trp Met Phe Gly Leu Pro Phe Cys Lys Phe Val Ser Ala Met Leu His 
85 90 95 



He His Met Tyr Leu Thr Phe Leu Phe ?yr Val Val He Leu Val Thr 
100 105 110 



Arg Tyr Leu He Phe Phe Lys Cys Lys Asp Lys Val Glu Phe Tyr Arg 
115 120 125 
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Lys Leu His Ala Val Ala Ala Ser Ala Gly Met Trp Thr Leu Val He 
130 135 140 



Val He Val Val Pro Leu Val Val Ser Arg Tyr Gly He His Glu Glu 
145 150 155 160 



Tyr Asn Glu Glu His Cys Phe Lys Phe His Lys Glu Leu Ala Tyr Thr 
165 170 175 



Tyr Val Lys He He Asn Tyr Met He Val He Phe Val He Ala Val 
180 185 190 



Ala Val He Leu Leu Val Phe Gin Val Phe He He Met Leu Met Val 
195 200 205 



Gin Lys Leu Arg His Ser Leu Leu Ser His Gin Glu Phe Trp Ala Gin 
210 215 220 



Leu Lys Asn Leu Phe Phe He Gly Val lie Leu Val Cys Phe Leu Pro 
225 230 235 240 



Tyr Gin Phe Phe Arg lie Tyr Tyr Leu Asn Val Val Thr His Ser Asn 
245 250 255 



Ala Cys Asn Ser Lys Val Ala Phe Tyr Asn Glu lie Phe Leu Ser Val 
260 265 270 



Thr Ala He Ser Cys Tyr Asp Leu Leu Leu Phe Val Phe Gly Gly Ser 
275 280 285 



His Trp Phe Lys Gin Lys lie lie Gly Leu Trp Asn Cys Val Leu Cys 
290 295 300 



Arg 
305 



<210> 5 
<211> 1125 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 
<400>' 5 

atgctgacag ggagctgcgg ggaccctcag aaaaagccac aggtgaccca ggactcaggg 60 
ccccagagca tggggcttga gggacgagag acagctggcc agccacgagt gaccctgctg 120 
cccacgccca acgtcagcgg gctgagccag gagtttgaaa gccactggcc agagatcgca 180 
gagaggtccc cgtgtgtggc tggcgtcatc cctgtcatct actacagtgt cctgctgggc 240 
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ttggggctgc ctgtcagcct cctgaccgca gtggccctgg cgcgccttgc caccaggacc 300 

aggaggccct cctactacta ccttctggcg ctcacagcct cggatatcat catccaggtg 360 

gtcatcgtgt tcgcgggctt cctcctgcag ggagcagtgc tggcccgcca ggtgccccag 420 

gctgtggtgc gcacggccaa catcctggag tttgctgcca accacgcctc agtctggatc 480 

gccatcctgc tcacggttga ccgctacact gccctgtgcc accccctgca ccatcgggcc 54 0 

gcctcgtccc caggccggac ccgccgggcc attgctgctg tcctgagtgc tgccctgttg 600 

accggcatcc ccttctactg gtggctggac atgtggagag acaccgactc acccagaaca 660 

ctggacgagg tcctcaagtg ggctcactgt ctcactgtct atttcatccc ttgtggcgtg 720 

ttcctggtca ccaactcggc catcatccac cggctacgga ggaggggccg gagtgggctg 780 

cagccccggg tgggcaagag cacagccatc ctcctgggca tcaccacact gttcaccctc 840 

ctgtgggcgc cccgggtctt cgtcatgctc taccacatgt acgtggcccc tgtccaccgg 900 

gactggaggg tccacctggc cttggatgtg gccaatatgg tggccatgct ccacacggca 960 

gccaacttcg gcctctactg ctttgtcagc aagactttcc gggccactgt ccgacaggtc 1020 

atccacgatg cctacctgcc ctgcactttg gcatcacagc cagagggcat ggcggcgaag 1080 

cctgtgatgg agcctccggg actccccaca ggggcagaag tgtag 1125 

<210> 6 

<211> 374 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 6 

Met Leu Thr Gly Ser Cys Gly Asp Pro Gin Lys Lys Pro Gin Val Thr 



Gin Asp Ser Gly Pro Gin Ser Met Gly Leu Glu Gly Arg Glu Thr Ala 
20 25 30 

Gly Gin Pro Arg Val Thr Leu Leu Pro Thr Pro Asn Val Ser Gly Leu 
35 40 45 

Ser Gin Glu Phe Glu Ser His Trp Pro Glu lie Ala Glu Arg Ser Pro 
50 55 60 

Cys Val Ala Gly Val lie Pro Val lie Tyr Tyr Ser Val Leu Leu Gly 
65 70 " 75 80 

Leu Gly Leu Pro Val Ser Leu Leu Thr Ala Val Ala Leu Ala Arg Leu 
85 90 . 95 

Ala Thr Arg Thr Arg Arg Pro Ser Tyr Tyr Tyr Leu Leu Ala Leu Thr 
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100 105 110 



Ala Ser Asp lie lie He Gin Val Val He Val Phe Ala Gly Phe Leu 
115 120 125 



Leu Gin Gly Ala Val Leu Ala Arg Gin Val Pro Gin Ala Val Val Arg 
130 135 140 



Thr Ala Asn He Leu Glu Phe Ala Ala Asn His Ala Ser Val Trp He 
145 150 155 160 



Ala He Leu Leu Thr Val Asp Arg Tyr Thr Ala Leu Cys His Pro Leu 
165 170 175 



His His Arg Ala Ala Ser Ser Pro Gly Arg Thr Arg Arg Ala He Ala 
180 185 190 



Ala Val Leu Ser Ala Ala Leu Leu Thr Gly He Pro Phe Tyr Trp Trp 
195 200 205 



Leu Asp Met Trp Arg Asp Thr Asp Ser Pro Arg Thr Leu Asp Glu Val 
210 215 220 



Leu Lys Trp Ala His Cys Leu Thr Val Tyr Phe He Pro Cys Gly Val 
225 ~ 230 235 240 



Phe Leu Val Thr Asn Ser Ala He He His Arg Leu Arg Arg Arg Gly 
245 250 255 



Arg Ser Gly Leu Gin Pro Arg Val Gly Lys Ser Thr Ala He Leu Leu 
260 265 270 



Gly He Thr Thr Leu Phe Thr Leu Leu Trp Ala Pro Arg Val Phe Val 
275 280 285 



Met Leu Tyr His Met Tyr Val Ala Pro Val His Arg Asp Trp Arg Val 
290 295 300 



His Leu Ala Leu Asp Val Ala Asn Met Val Ala Met Leu His Thr Ala 
305 310 315 320 



Ala Asn Phe Gly Leu Tyr Cys Phe Val Ser Lys Thr Phe Arg Ala Thr 
325 330 335 



Val Arg Gin Val He His Asp Ala Tyr Leu Pro Cys Thr Leu Ala Ser 
340 345 350 



Gin Pro Glu Gly Met Ala Ala Lys Pro Val Met Glu Pro Pro Gly Leu 
355 360 365 
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Pro Thr Gly Ala Glu Val 
370 

<210> 7 

<211> 1086 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 



<400> 7 
atgtccactg 


aatgcgcgcg 


ggcagcgggc 


gacgcgccct tgcgcagcct 


ggagcaagcc 


60 


aaccgcaccc 


gctttccctt 


cttctccgac 


gtcaagggcg accaccggct 


ggtgctggcc 


120 


gcggtggaga 


caaccgtgct 


ggtgctcatc 


tttgcagtgt cgctgctggg 


caacgtgtgc 


180 


gccctggtgc 


tggtggcgcg 

3 3 3 3 3 3 


ccgacgacgc 


cgcggcgcga ctgcctgcct 


ggtactcaac 


240 


ctcttctgcg 


cggacctgct 


cttcatcagc 


gctatccctc tggtgctggc 


cgtgcgctgg 


300 


actgaggcct 


ggctgctggg 


ccccgttgcc 


tgccacctgc tcttctacgt 


gatgaccctg 


3 60 


agcggcagcg 


tcaccatcct 


cacgctggcc 


gcggtcagcc tggagcgcat 


ggtgtgcatc 


420 


gtgcacctgc 


agcgcggcgt 


gcggggtcct 


gggcggcggg cgcgggcagt 


gctgctggcg 


480 


ctcatatggg 


gctattcggc 


ggtcgccgct 


ctgcctctat gcgtcttctt 


ccgagtcgtc 


540 


ccgcaacggc 


tccccggcgc 


cgaccaggaa 


atttcgattt gcacactgat 


ttggcccacc 


600 


attcctggag 


agatctcgtg 


ggatgtctct 


tttgttactt tgaacttctt ggtgccagga 


660 


ctggtcattg 


tgatcagtta 


ctccaaaatt 


ttacagatca caaaggcatc 


aaggaagagg 


720 


ctcacggtaa 


gcctggccta 


ctcggagagc 


caccagatcc gcgtgtccca gcaggacttc 


780 


cggctcttcc 


gcaccctctt 


cctcctcatg 


gtctccttct tcatcatgtg gagccccatc 


840 


atcatcacca 


tcctcctcat 


cctgatccag 


aacttcaagc aagacctggt 


catctggccg 


900 


tccctcttct 


tctgggtggt 


ggccttcaca 


tttgctaatt cagccctaaa ccccatcctc 


960 


tacaacatga 


cactgtgcag 


gaatgagtgg 


aagaaaattt tttgctgctt 


ctggttccca 


1020 


gaaaagggag 


ccattttaac 


agacacatct 


gtcaaaagaa atgacttgtc gattatttct 


1080 


ggctaa 










1086 



<210> 8 

<211> 361 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 8 

Met Ser Thr Glu Cys Ala Arg Ala Ala Gly Asp Ala Pro Leu Arg Ser 
1 5 10 15 
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Leu Glu Gin Ala Asn Arg Thr Arg Phe Pro Phe Phe Ser Asp Val Lys 
20 25 30 



Gly Asp His Arg Leu Val Leu Ala Ala Val Glu Thr Thr Val Leu Val 
35 40 45 



Leu lie Phe Ala Val Ser Leu Leu Gly Asn Val Cys Ala Leu Val Leu 
50 55 60 



Val Ala Arg Arg Arg Arg Arg Gly Ala Thr Ala Cys Leu Val Leu Asn 
65 70 75 80 



Leu Phe Cys Ala Asp Leu Leu Phe lie Ser Ala lie Pro Leu Val Leu 
85 90 95 



Ala Val Arg Trp Thr Glu Ala Trp Leu Leu Gly Pro Val Ala Cys His 
100 105 110 



Leu Leu Phe Tyr Val Met Thr Leu Ser Gly Ser Val Thr He Leu Thr 
115 120 125 



Leu Ala Ala Val Ser Leu Glu Arg Met Val Cys He Val His Leu Gin 
130 135 140 



Arg Gly Val Arg Gly Pro Gly Arg Arg Ala Arg Ala Val Leu Leu Ala 
145 150 " " 155 160 



Leu He Trp Gly Tyr Ser Ala Val Ala Ala Leu Pro Leu Cys Val Phe 
165 170 175 



Phe Arg Val Val Pro Gin Arg Leu Pro Gly Ala Asp Gin Glu lie Ser 
180 185 190 



He Cys Thr Leu He Trp Pro Thr He Pro Gly Glu He Ser Trp Asp 
195 200 205 



Val Ser Phe Val Thr Leu Asn Phe Leu Val Pro Gly Leu Val He Val 
210 215 220 



He Ser Tyr Ser Lys He Leu Gin He Thr Lys Ala Ser Arg Lys Arg 
225 ' 230 235 " 240 



Leu Thr Val Ser Leu Ala Tyr Ser Glu Ser His Gin He Arg Val Ser 
245 250 255 



Gin Gin Asp Phe Arg Leu Phe Arg Thr Leu Phe Leu Leu Met Val Ser 
260 265 270 



Phe Phe He Met Trp Ser Pro He He He Thr He Leu Leu He Leu 
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275 280 285 

He Gin Asn Phe Lys Gin Asp Leu Val He Trp Pro Ser Leu Phe Phe 
290 295 300 

Trp Val Val Ala Phe Thr Phe Ala Asn Ser Ala Leu Asn Pro He Leu 
305 310 315 320 

Tyr Asn Met Thr Leu Cys Arg Asn Glu Trp Lys Lys He Phe Cys Cys 
325 330 335 

Phe Trp Phe Pro Glu Lys Gly Ala He Leu Thr Asp Thr Ser Val Lys 
340 345 350 

Arg Asn Asp Leu Ser He He Ser Gly 
355 360 

<210> 9 
<211> 1038 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 
<400> 9 



atgagcagca 


attcatccct 


gctggtggct gtgcagctgt gctacgcgaa 


cgtgaatggg 


60 


tcctgtgtga 


aaatcccctt 


ctcgccggga tcccgggtga ttctgtacat 


agtgtttggc 


120 


tttggggctg 


tgctggctgt 


gtttggaaac ctcctggtga tgatttcaat 


cctccatttc 


180 


aagcagctgc 


actctccgac 


caattttctc gttgcctctc tggcctgcgc 


tgatttcttg 


240 


gtgggtgtga 


ctgtgatgcc 


cttcagcatg gtcaggacgg tggagagctg 


ctggtatttt 


300 


gggaggagtt 


tttgtacttt 


ccacacctgc tgtgatgtgg cattttgtta 


ctcttctctc 


360 


tttcacttgt 


gcttcatctc 


catcgacagg tacattgcgg ttactgaccc 


cctggtctat 


420 


cctaccaagt 


tcaccgtatc 


tgtgtcagga atttgcatca gcgtgtcctg 


gatcctgccc 


480 


ctcatgtaca 


gcggtgctgt 


gttctacaca ggtgtctatg acgatgggct 


ggaggaatta 


540 


tctgatgccc 


taaactgtat 


aggaggttgt cagaccgttg taaatcaaaa 


ctgggtgttg 


600 


acagattttc 


tatccttctt 


tatacctacc tttattatga taattctgta 


tggtaacata 


660 


tttcttgtgg 


ctagacgaca 


ggcgaaaaag atagaaaata ctggtagcaa 


gacagaatca 


720 


tcctcagaga 


gttacaaagc 


cagagtggcc aggagagaga gaaaagcagc 


taaaaccctg 


780 


ggggtcacag 


tggtagcatt 


tatgatttca tggttaccat atagcattga 


ttcattaatt 


840 


gatgccttta 


tgggctttat 


aacccctgcc tgtatttatg agatttgctg 


ttggtgtgct 


900 


tattataact 


cagccatgaa 


tcctttgatt tatgctttat tttacccatg 


gtttaggaaa 


960 


gcaataaaag 


ttattgtaac 


tggtcaggtt ttaaagaaca gttcagcaac 


catgaatttg 


1020 
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ttttctgaac atatataa 



1038 



<210> 


10 


<211> 


345 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Novel Sequence 


<400> 


10 



Met Ser Ser Asn Ser Ser Leu Leu Val Ala Val Gin Leu Cys Tyr Ala 
15 10 15 



Asn Val Asn Gly Ser Cys Val Lys lie Pro Phe Ser Pro Gly Ser Arg 
20 25 30 



Val lie Leu Tyr He Val Phe Gly Phe Gly Ala Val Leu Ala Val Phe 
35 40 45 



Gly Asn Leu Leu Val Met He Ser He Leu His Phe Lys Gin Leu His 
50 55 60 



Ser Pro Thr Asn Phe Leu Val Ala Ser Leu Ala Cys Ala Asp Phe Leu 
65 70 75 80 



Val Gly Val Thr Val Met Pro Phe Ser Met Val Arg Thr Val Glu Ser 
85 90 95 



Cys Trp Tyr Phe Gly Arg Ser Phe Cys Thr Phe His Thr Cys Cys Asp 
100 105 110 



Val Ala Phe Cys Tyr Ser Ser Leu Phe His Leu Cys Phe He Ser He 
115 120 - 125 



Asp Arg Tyr He Ala Val Thr Asp Pro Leu Val Tyr Pro Thr Lys Phe 
130 135 140 



Thr Val Ser Val Ser Gly He Cys He Ser Val Ser Trp He Leu Pro 
145 150 155 160 



Leu Met Tyr Ser Gly Ala Val Phe Tyr Thr Gly Val Tyr Asp Asp Gly 
165 170 175 



Leu Glu Glu Leu Ser Asp Ala Leu Asn Cys He Gly Gly Cys Gin Thr 
180 185 * 190 



Val Val Asn Gin Asn Trp Val Leu Thr Asp Phe Leu Ser Phe Phe He 
195 200 205 



Pro Thr Phe He Met He lie Leu Tyr Gly Asn He Phe Leu Val Ala 
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210 215 220 



Arg Arg Gin Ala Lys Lys lie Glu Asn Thr Gly Ser Lys Thr Glu Ser 
225 230 235 240 



Ser Ser Glu Ser Tyr Lys Ala Arg Val Ala Arg Arg Glu Arg Lys Ala 
245 250 255 



Ala Lys Thr Leu Gly Val Thr Val Val Ala Phe Met lie Ser Trp Leu 
260 265 270 

Pro Tyr Ser lie Asp Ser Leu lie Asp Ala Phe Met Gly Phe lie Thr 
275 280 ' 285 

Pro Ala Cys lie Tyr Glu He Cys Cys Trp Cys Ala Tyr Tyr Asn Ser 
290 295 300 

Ala Met Asn Pro Leu He Tyr Ala Leu Phe Tyr Pro Trp Phe Arg Lys 
305 310 315 320 

Ala He Lys Val He Val Thr Gly Gin Val Leu Lys Asn Ser Ser Ala 
325 330 335 

Thr Met Asn Leu Phe Ser Glu His He 
340 345 

<210> 11 
<211> 1020 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 
<400> 11 

atgatgccct tttgccacaa tataattaat atttcctgtg tgaaaaacaa ctggtcaaat 60 
gatgtccgtg cttccctgta cagtttaatg gtgctcataa ttctgaccac actcgttggc 120 
aatctgatag ttattgtttc tatatcacac ttcaaacaac ttcatacccc aacaaattgg 180 
ctcattcatt ccatggccac tgtggacttt cttctggggt gtctggtcat gccttacagt 240 
atggtgagat ctgctgagca ctgttggtat tttggagaag tcttctgtaa aattcacaca 300 
agcaccgaca ttatgctgag ctcagcctcc attttccatt tgtctttcat ctccattgac 360 
cgctactatg ctgtgtgtga tccactgaga tataaagcca agatgaatat cttggttatt 420 
tgtgtgatga tcttcattag ttggagtgtc cctgctgttt ttgcatttgg aatgatcttt 480 
ctggagctaa acttcaaagg cgctgaagag atatattaca aacatgttca ctgcagagga 540 
ggttgctctg tcttctttag caaaatatct ggggtactga cctttatgac ttctttttat 600 
atacctggat ctattatgtt atgtgtctat tacagaatat atcttatcgc taaagaacag 660 
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gcaagattaa 


ttagtgatgc 


caatcagaag 


ctccaaattg gattggaaat gaaaaatgga 


720 


atttcacaaa 


gcaaagaaag 


gaaagctgtg 


aagacattgg ggattgtgat gggagttttc 


780 


ctaatatgct 


ggtgcccttt 


ctttatctgt 


acagtcatgg acccttttct tcactacatt 


840 


attccaccta 


ctttgaatga 


tgtattgatt 


tggtttggct acttgaactc tacatttaat 


900 


ccaatggttt 


atgcattttt 


ctatccttgg 


tttagaaaag cactgaagat gatgctgttt 


960 


ggtaaaattt 


tccaaaaaga 


ttcatccagg 


tgtaaattat ttttggaatt gagttcatag 


1020 



<210> 


12 


<211> 


339 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Novel Sequence 


<400> 


12 



Met Met Pro Phe Cys His Asn He He Asn He Ser Cys Val Lys Asn 
15 10 15 



Asn Trp Ser Asn Asp Val Arg Ala Ser Leu Tyr Ser Leu Met Val Leu 
20 25 30 



He He Leu Thr Thr Leu Val Gly Asn Leu He Val He Val Ser He 
35 40 45 



Ser His Phe Lys Gin Leu His Thr Pro Thr Asn Trp Leu He His Ser 
50 55 60 



Met Ala Thr Val Asp Phe Leu Leu Gly Cys Leu Val Met Pro Tyr Ser 
65 70 75 80 



Met Val Arg Ser Ala Glu His Cys Trp Tyr Phe Gly Glu Val Phe Cys 
85 90 95 



Lys He His Thr Ser Thr Asp He Met Leu Ser Ser Ala Ser He Phe 
100 105 110 



His Leu Ser Phe He Ser He Asp Arg Tyr Tyr Ala Val Cys Asp Pro 
115 120 125 



Leu Arg Tyr Lys Ala Lys Met Asn He Leu Val He Cys Val Met He 
130 135 140 



Phe He Ser Trp Ser Val Pro Ala Val Phe Ala Phe Gly Met He Phe 
145 150 155 160 



Leu Glu Leu Asn Phe Lys Gly Ala Glu Glu He Tyr Tyr Lys His Val 
165 170 175 
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His Cys Arg Gly Gly Cys Ser Val Phe Phe Ser Lys He Ser Gly Val 
180 185 190 



Leu Thr Phe Met Thr Ser Phe Tyr He Pro Gly Ser He Met Leu Cys 
195 200 " 205 



Val Tyr Tyr Arg He Tyr Leu He Ala Lys Glu Gin Ala Arg Leu He 
210 215 220 



Ser Asp Ala Asn Gin Lys Leu Gin He Gly Leu Glu Met Lys Asn Gly 
225 230 235 240 



He Ser Gin Ser Lys Glu Arg Lys Ala Val Lys Thr Leu Gly He Val 
245 250 255 



Met Gly Val Phe Leu He Cys Trp Cys Pro Phe Phe He Cys Thr Val 
260 265 270 



Met Asp Pro Phe Leu His Tyr He He Pro Pro Thr Leu Asn Asp Val 
275 280 285 



Leu He Trp Phe Gly Tyr Leu Asn Ser Thr Phe Asn Pro Met Val Tyr 
290 295 300 



Ala Phe Phe Tyr Pro Trp Phe Arg Lys Ala Leu Lys Met Met Leu Phe 
305 310 315 320 



Gly Lys He Phe Gin Lys Asp Ser Ser Arg Cys Lys Leu Phe Leu Glu 
325 330 335 



Leu Ser Ser 



<210> 13 
<211> 1029 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 
<400> 13 

atgaccagca atttttccca acctgttgtg cagctttgct atgaggatgt gaatggatct 60 
tgtattgaaa ctccctattc tcctgggtcc cgggtaattc tgtacacggc gtttagcttt 120 
gggtctttgc tggctgtatt tggaaatctc ttagtaatga cttctgttct tcattttaag 180 
cagctgcact ctccaaccaa ttttctcatt gcctctctgg cctgtgctga cttcttggta 240 
ggtgtgactg tgatgctttt cagcatggtc aggacggtgg agagctgctg gtattttgga 300 
gccaaatttt gtactcttca cagttgctgt gatgtggcat tttgttactc ttctgtcctc 3 60 
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cacttgtgct tcatctgcat cgacaggtac attgtggtta ctgatcccct ggtctatgct 420 

accaagttca ccgtgtctgt gtcgggaatt tgcatcagcg tgtcctggat tctgcctctc 480 

acgtacagcg gtgctgtgtt ctacacaggt gtcaatgatg atgggctgga ggaattagta 540 

agtgctctca actgcgtagg tggctgtcaa attattgtaa gtcaaggctg ggtgttgata 600 

gattttctgt tattcttcat acctaccctt gttatgataa ttctttacag taagattttt 660 

cttatagcta aacaacaagc tataaaaatt gaaactacta gtagcaaagt agaatcatcc 720 

tcagagagtt ataaaatcag agtggccaag agagagagga aagcagctaa aaccctgggg 780 

gtcacggtac tagcatttgt tatttcatgg ttaccgtata cagttgatat attaattgat 840 

gcctttatgg gcttcctgac ccctgcctat atctatgaaa tttgctgttg gagtgcttat 900 

tataactcag ccatgaatcc tttgatttat gctctatttt atccttggtt taggaaagcc 960 

ataaaactta ttttaagtgg agatgtttta aaggctagtt catcaaccat tagtttattt 1020 

ttagaataa 1029 

<210> 14 

<211> 342 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 14 

Met Thr Ser Asn Phe Ser Gin Pro Val Val Gin Leu Cys Tyr Glu Asp 
1 5 10 15 

Val Asn Gly Ser Cys He Glu Thr Pro Tyr Ser Pro Gly Ser Arg Val 
20 25 30 

He Leu Tyr Thr Ala Phe Ser Phe Gly Ser Leu Leu Ala Val Phe Gly 
35 40 45 

Asn Leu Leu Val Met Thr Ser "Val Leu His Phe Lys Gin Leu His Ser 
50 55 60 

Pro Thr Asn Phe Leu He. Ala Ser Leu Ala Cys Ala Asp Phe Leu Val 
65 70 75 80 

Gly Val Thr Val Met Leu Phe Ser Met Val Arg Thr Val Glu Ser Cys 
85 90 95 

Trp Tyr Phe Gly Ala Lys Phe Cys Thr Leu His Ser Cys Cys Asp Val 
100 105 110 

Ala Phe Cys Tyr Ser Ser Val Leu His Leu Cys Phe He Cys He Asp 
115 120 125 
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Arg Tyr He Val Val Thr Asp Pro Leu Val Tyr Ala Thr Lys Phe Thr 
130 135 140 



Val Ser Val Ser Gly He Cys He Ser Val Ser Trp He Leu Pro Leu 
145 150 155 160 



Thr Tyr Ser Gly Ala Val Phe Tyr Thr Gly Val Asn Asp Asp Gly Leu 
165 170 175 



Glu Glu Leu Val Ser Ala Leu Asn Cys Val Gly Gly Cys Gin He He 
180 185 190 



Val Ser Gin Gly Trp Val Leu He Asp Phe Leu Leu Phe Phe He Pro 
195 200 205 



Thr Leu Val Met He He Leu Tyr Ser Lys He Phe Leu He Ala Lys 
210 215 220 



Gin Gin Ala He Lys He Glu Thr Thr Ser Ser Lys Val Glu Ser Ser 
225 230 235 240 



Ser Glu Ser Tyr Lys He Arg Val Ala Lys Arg Glu Arg Lys Ala Ala 
245 250 255 



Lys Thr Leu Gly Val Thr Val Leu Ala Phe Val He Ser Trp Leu Pro 
260 265 270 



Tyr Thr Val Asp He Leu He Asp Ala Phe Met Gly Phe Leu Thr Pro 
275 280 285 



Ala Tyr He Tyr Glu He Cys Cys Trp Ser Ala Tyr Tyr Asn Ser Ala 
290 295 300 



Met Asn Pro Leu He Tyr Ala Leu Phe Tyr Pro Trp Phe Arg Lys Ala 
305 310 315 " 320 



He Lys Leu He Leu Ser Gly Asp Val Leu Lys Ala Ser Ser Ser Thr 
325 330 335 



He Ser Leu Phe Leu Glu 
340 



<210> 15 

<211> 1062 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 15 
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atggagcaca 


cgcacgccca 


cctcgcagcc 


aacagctcgc 


tgtcttggtg 


gtcccccggc 


60 


tcggcctgcg 


gcttgggttt 


cgtgcccgtg 


gtctactaca 


gcctcttgct 


gtgcctcggt 


120 


ttaccagcaa 


atatcttgac 


agtgatcatc 


ctctcccagc 


tggtggcaag 


aagacagaag 


180 


tcctcctaca 


actatctctt 


ggcactcgct 


gctgccgaca 


tcttggtcct 


ctttttcata 


240 


gtgtttgtgg 


acttcctgtt 


ggaagatttc 


atcttgaaca 


tgcagatgcc 


tcaggtcccc 


300 


gacaagatca 


tagaagtgct 


ggaattctca 


tccatccaca 


cctccatatg gattactgta 


360 


ccgttaacca 


ttgacaggta 


tatcgctgtc 


tgccacccgc 


tcaagtacca cacggtctca 


420 


tacccagccc 


gcacccggaa 


agtcattgta 


agtgtttaca 


tcacctgctt 


cctgaccagc 


480 


atcccctatt 


actggtggcc 


caacatctgg 


actgaagact 


acatcagcac ctctgtgcat 


540 


cacgtcctca 


tctggatcca 


ctgcttcacc 


gtctacctgg 


tgccctgctc 


catcttcttc 


600 


atcttgaact 


caatcattgt 


gtacaagctc 


aggaggaaga gcaattttcg tctccgtggc 


660 


tactccacgg 


ggaagaccac 


cgccatcttg 


ttcaccatta 


cctccatctt 


tgccacactt 


720 


tgggcccccc 


gcatcatcat 


gattctttac 


cacctctatg gggcgcccat 


ccagaaccgc 


780 


tggctggtgc 


acatcatgtc 


cgacattgcc 


aacatgctag 


cccttctgaa 


cacagccatc 


840 


aacttcttcc 


tctactgctt 


catcagcaag 


cggttccgca 


ccatggcagc 


cgccacgctc 


900 


aaggctttct 


tcaagtgcca 


gaagcaacct 


gtacagttct 


acaccaatca 


taacttttcc 


960 


ataacaagta 


gcccctggat 


ctcgccggca 


aactcacact 


gcatcaagat 


gctggtgtac 


1020 


cagtatgaca 


aaaatggaaa 


acctataaaa 


gtatccccgt 


ga 




1062 



<210> 16 
<211> 353 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 
<400> 16 

Met Glu His Thr His Ala His Leu Ala Ala Asn Ser Ser Leu Ser Trp 
15 10 15 

Trp Ser Pro Gly Ser Ala Cys Gly Leu Gly Phe Val Pro Val Val Tyr 
20 25 30 

Tyr t Ser Leu Leu Leu Cys Leu Gly Leu Pro Ala Asn lie Leu Thr Val 
35 40 45 

lie lie Leu Ser Gin Leu Val Ala Arg Arg Gin Lys Ser Ser Tyr Asn 
50 55 60 

Tyr Leu Leu Ala Leu Ala Ala Ala Asp He Leu Val Leu Phe Phe He 
65 70 75 „ 80 
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Val Phe Val Asp Phe Leu Leu Glu Asp Phe He Leu Asn Met Gin Met 
85 90 95 



Pro Gin Val Pro Asp Lys He He Glu Val Leu Glu Phe Ser Ser He 
100 105 110 



His Thr Ser He Trp He Thr Val Pro Leu Thr lie Asp Arg Tyr He 
115 120 125 



Ala Val Cys His Pro Leu Lys Tyr His Thr Val Ser Tyr Pro Ala Arg 
130 135 140 



Thr Arg Lys Val He Val Ser Val Tyr lie Thr Cys Phe Leu Thr Ser 
145 150 155 160 



He Pro Tyr Tyr Trp Trp Pro Asn lie Trp Thr Glu Asp Tyr He Ser 
165 170 175 



Thr Ser Val His His Val Leu lie Trp He His Cys Phe Thr Val Tyr 
180 185 " 190 



Leu Val Pro Cys Ser He Phe Phe lie Leu Asn Ser lie lie Val Tyr 
195 200 205 



Lys Leu Arg Arg Lys Ser Asn Phe Arg Leu Arg Gly Tyr Ser Thr Gly 
210 215 220 



Lys Thr Thr Ala lie Leu Phe Thr He Thr Ser He Phe Ala Thr Leu 
225 230 235 ' 240 



Trp Ala Pro Arg lie lie Met lie Leu Tyr His Leu Tyr Gly Ala Pro 
245 250 255 



lie Gin Asn Arg Trp Leu Val His He Met Ser Asp lie Ala Asn Met 
260 265 270 



Leu Ala Leu Leu Asn Thr Ala lie Asn Phe Phe Leu Tyr Cys Phe He 
275 280 285 



Ser Lys Arg Phe Arg Thr Met Ala Ala Ala Thr Leu Lys Ala Phe Phe 
290 295 300 



Lys Cys Gin Lys Gin Pro Val Gin Phe Tyr Thr Asn His Asn Phe Ser 
305 310 315 320 



lie Thr Ser Ser Pro Trp lie Ser Pro Ala Asn Ser His Cys lie Lys 
325 330 335 



Met Leu Val Tyr Gin Tyr Asp Lys Asn Gly Lys Pro lie Lys Val Ser 
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340 



345 



350 



Pro 



<210> 17 

<211> 969 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 17 



atggatccaa 


ccgtcccagt 


cttcggtaca aaactgacac caatcaacgg 


acgtgaggag 


60 


actccttgct 


acaatcagac 


cctgagcttc 


acggtgctga cgtgcatcat 


ttcccttgtc 


120 


ggactgacag 


gaaacgcggt 


agtgctctgg ctcctgggct accgcatgcg 


caggaacgct 


180 


gtctccatct 


acatcctcaa 


cctggccgca gcagacttcc tcttcctcag cttccagatt 


240 


atacgttcgc 


cattacgcct 


catcaatatc 


agccatctca tccgcaaaat 


cctcgtttct 


300 


gtgatgacct 


ttccctactt 


tacaggcctg agtatgctga gcgccatcag 


caccgagcgc 


360 


tgcctgtctg 


ttctgtggcc 


catctggtac 


cgctgccgcc gccccacaca 


cctgtcagcg 


420 


gtcgtgtgtg 


tcctgctctg 


gggcctgtcc 


ctgctgttta gtatgctgga 


gtggaggttc 


480 


tgtgacttcc 


tgtttagtgg 


tgctgattct 


agttggtgtg aaacgtcaga 


tttcatccca 


540 


gtcgcgtggc 


tgattttttt 


atgtgtggtt 


ctctgtgttt ccagcctggt 


cctgctggtc 


600 


aggatcctct 


gtggatcccg 


gaagatgccg 


ctgaccaggc tgtacgtgac 


catcctgctc 


660 


acagtgctgg 


tcttcctcct 


ctgcggcctg 


cccttcggca ttctgggggc 


cctaatttac 


720 


aggatgcacc 


tgaatttgga 


agtcttatat 


tgtcatgttt atctggtttg 


catgtccctg 


780 


tcctctctaa 


acagtagtgc 


caaccccatc 


atttacttct tcgtgggctc 


ctttaggcag 


840 


cgtcaaaata 


ggcagaacct 


gaagctggtt 


ctccagaggg ctctgcagga 


caagcctgag 


900 


gtggataaag 


gtgaagggca 


gcttcctgag gaaagcctgg agctgtcggg aagcagattg 


960 


gggccatga 










969 



<210> 18 
<211> 322 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 
<400> 18 

Met Asp Pro Thr Val Pro Val Phe Gly Thr Lys Leu Thr Pro He Asn 
15 10 15 

Gly Arg Glu Glu Thr Pro Cys Tyr Asn Gin Thr Leu Ser Phe Thr Val 
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20 



25 



30 



Leu Thr Cys lie lie Ser Leu Val Gly Leu Thr Gly Asn Ala Val Val 
35 40 45 



Leu Trp Leu «Leu Gly Tyr Arg Met Arg Arg Asn Ala Val Ser lie Tyr 
50 55 60 



lie Leu Asn Leu Ala Ala Ala Asp Phe Leu Phe Leu Ser Phe Gin lie 
65 70 75 80 



lie Arg Ser Pro Leu Arg Leu lie Asn lie Ser His Leu lie Arg Lys 
85 90 95 



lie Leu Val Ser Val Met Thr Phe Pro Tyr Phe Thr Gly Leu Ser Met 
100 105 110 



Leu Ser Ala He Ser Thr Glu Arg Cys Leu Ser Val Leu Trp Pro He 
115 120 125 



Trp Tyr Arg Cys Arg Arg Pro Thr His Leu Ser Ala Val Val Cys Val 
130 " 135 140 



Leu Leu Trp Gly Leu Ser Leu Leu Phe Ser Met Leu Glu Trp Arg Phe 
145 150 155 160 



Cys Asp Phe Leu Phe Ser Gly Ala Asp Ser Ser Trp Cys Glu Thr Ser 
165 170 175 



Asp Phe He Pro Val Ala Trp Leu He Phe Leu Cys Val Val Leu Cys 
180 185 190 



Val Ser Ser Leu Val Leu Leu Val Arg He Leu Cys Gly Ser Arg Lys 
195 200 205 



Met Pro Leu Thr Arg Leu Tyr Val Thr He Leu Leu Thr Val Leu Val 
210 215 220 



Phe Leu Leu Cys Gly Leu Pro Phe Gly He' Leu Gly Ala Leu He Tyr 
225 230 235 240 



Arg Met His Leu Asn Leu Glu Val Leu Tyr Cys His Val Tyr Leu Val 
245 250 • 255 



Cys Met Ser Leu Ser Ser Leu Asn Ser Ser Ala Asn Pro He He Tyr 
260 265 270 



Phe Phe Val Gly Ser Phe Arg Gin Arg Gin Asn Arg Gin Asn Leu Lys 
275 280 285 
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Leu Val Leu Gin Arg Ala Leu Gin Asp Lys Pro Glu Val Asp Lys Gly 
290 295 300 

Glu Gly Gin Leu Pro Glu Glu Ser Leu Glu Leu Ser Gly Ser Arg Leu 
305 310 315 320 

Gly Pro 

<210> 19 

<211> 969 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 



<400> 19 



atggattcaa 


ccatcccagt 


cttgggtaca 


gaactgacac 


caatcaacgg acgtgaggag 


60 


actccttgct 


acaagcagac 


cctgagcttc 


acggggctga 


cgtgcatcgt 


ttcccttgtc 


120 


gcgctgacag 


gaaacgcggt 


tgtgctctgg 


ctcctgggct 


gccgcatgcg 


caggaacgct 


180 


gtctccatct 


acatcctcaa 


cctggtcgcg 


gccgacttcc 


tcttccttag 


cggccacatt 


240 


atatgttcgc 


cgttacgcct 


catcaatatc 


cgccatccca 


tctccaaaat 


cctcagtcct 


300 


gtgatgacct 


ttccctactt 


tataggccta 


agcatgctga 


gcgccatcag 


caccgagcgc 


360 


tgcctgtcca 


tcctgtggcc 


catctggtac 


cactgccgcc 


gccccagata 


cctgtcatcg 


420 


gtcatgtgtg 


tcctgctctg 


ggccctgtcc 


ctgctgcgga 


gtatcctgga 


gtggatgttc 


480 


tgtgacttcc 


tgtttagtgg 


tgctgattct 


gtttggtgtg 


aaacgtcaga 


tttcattaca 


540 


atcgcgtggc 


tggttttttt 


atgtgtggtt 


ctctgtgggt 


ccagcctggt 


cctgctggtc 


600 


aggattctct 


gtggatcccg 


gaagatgccg 


ctgaccaggc 


tgtacgtgac 


catcctcctc 


660 


acagtgctgg 


tcttcctcct 


ctgtggcctg 


ccctttggca 


ttcagtgggc 


cctgttttcc 


720 


aggatccacc 


tggattggaa 


agtcttattt 


tgtcatgtgc 


atctagtttc cattttcctg 


780 


tccgctctta 


acagcagtgc 


caaccccatc 


atttacttct 


tcgtgggctc 


ctttaggcag 


840 


cgtcaaaata 


ggcagaacct 


gaagctggtt 


ctccagaggg 


ctctgcagga 


cacgcctgag 


900 


gtggatgaag 


gtggagggtg 


gcttcctcag 


gaaaccctgg 


agctgtcggg 


aagcagattg 


960 


gagcagtga 












969 



<210> 20 

<211> 322 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 20 



Page 21 



Met Asp Ser Thr lie Pro Val Leu Gly Thr Glu Leu Thr Pro lie Asn 
15 10 15 



Gly Arg Glu Glu Thr Pro Cys Tyr Lys Gin Thr Leu Ser Phe Thr Gly 
20 25 30 



Leu Thr Cys lie Val Ser Leu Val Ala Leu Thr Gly Asn Ala Val Val 
35 40 45 



Leu Trp Leu Leu Gly Cys Arg Met Arg Arg Asn Ala Val Ser lie Tyr 
50 55 60 



lie Leu Asn Leu Val Ala Ala Asp Phe Leu Phe Leu Ser Gly His lie 
65 70 75 80 



He Cys Ser Pro Leu Arg Leu He Asn He Arg His Pro He Ser Lys 
85 90 95 



He Leu Ser Pro Val Met Thr Phe Pro Tyr Phe He Gly Leu Ser Met 
100 105 110 



Leu Ser Ala He Ser Thr Glu Arg Cys Leu Ser He Leu Trp Pro He 
115 120 125 



Trp Tyr His Cys Arg Arg Pro Arg Tyr Leu Ser Ser Val Met Cys Val 
130 135 140 



Leu Leu Trp Ala Leu Ser Leu Leu Arg Ser He Leu Glu Trp Met Phe 
145 150 155 160 



Cys Asp Phe Leu Phe Ser Gly Ala Asp Ser Val Trp Cys Glu Thr Ser 
165 170 175 



Asp Phe He Thr He Ala Trp Leu Val Phe Leu Cys Val Val Leu Cys 
180 185 190 



Gly Ser Ser Leu Val Leu Leu Val Arg He Leu Cys Gly Ser Arg Lys 
195 200 205 



Met Pro Leu Thr Arg Leu Tyr Val Thr He Leu Leu Thr Val Leu Val 
210 215 220 



Phe Leu Leu Cys Gly Leu Pro Phe Gly He Gin Trp Ala Leu Phe Ser 
225 230 235 240 



Arg He His Leu Asp Trp Lys Val Leu Phe Cys His Val His Leu Val 
245 250 255 



Ser He Phe Leu Ser Ala Leu Asn Ser Ser Ala Asn Pro He He Tyr 
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260 



265 



270 



Phe 



Phe 



Val Gly Ser Phe Arg Gin Arg Gin Asn Arg Gin Asn Leu Lys 
275 280 285 



Leu 



Val 
290 



Leu Gin Arg Ala Leu Gin Asp Thr Pro Glu Val Asp Glu Gly 
295 300 



Gly 
305 



Gly 



Trp Leu Pro Gin Glu Thr Leu Glu Leu Ser Gly Ser Arg Leu 
310 315 320 



Glu Gin 



<210> 21 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 21 

cagagctctg gtggccacct ctgtcc 26' 



<210> 22 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 



<210> 23 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 23 

gtatgcctgg ccacaatacc tccagg 26 



<210> 24 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence . 



<400> 22 

ctgcgtccac cagagtcacg tctcc 



25 



<400> 24 

gtttgtggct aacggcacaa aacacaattc c 



31 
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<210> 25 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 25 

ggtaccacaa tgacaatcac cagcgtcc 

<210> 26 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 26 

ggaacgtgag gtacatgtgg atgtgcagc 

<210> 27 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 27 

gcagtgtagc ggtcaaccgt gagcagg 

<210> 28 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 28 

tgagcaggat ggcgatccag actgaggcgt gg 

<210> 29 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 29 

gaggtacagc tggcgatgct gacag 

<210> 30 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Novel Sequence 



<400> 30 

gtggccatga gccaccctga gctcc 



25 



<210> 31 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 31 

ggaatgtcca ctgaatgcgc gcgg 24 



<210> 32 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 



<210> 33 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 33 

gcgttatgag cagcaattca tccctgctgg 30 



<210> 34 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 



<210> 35 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 35 

ccctcaggaa tgatgccctt ttgccacaa 29 



<400> 32 

agctcgccag gtgtgagaaa ctcgg 



25 



<400> 34 

gtatcctgaa cttcgtctat acaactgc 



28 
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<210> 36 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 36 

atccatgtgg ttggtgcatg tggttcgt 28 



<210> 37 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 37 

aaacaacaaa cagcagaacc atgaccagc 29 

<210> 38 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 38 

acatagagac aagtgacatg tgtgaaccac 30 



<210> 39 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 39 

ggtatgagac cgtgtggtac ttgagc * 26 



<210> 40 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 40 

gtggcagaca gcgatatacc tgtcaatgg 29 



<210> 41 

<211> 27 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Novel Sequence 
<400> 41 

gcgctcatgg agcacacgca cgcccac 

<210> 42 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 42 

gaggcagtag ttgccacacc tatgg 

<210> 43 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 43 

catctggttt gtgttcccag gggcaccag 

<210> 44 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 44 

gacagtgttg ctctcaaagt cccgtctgac tg 

<210> 45 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 45 

ctgtttccag ggtcatcaga ctggg 

<210> 46 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 46 

gcagcattgc tctcaaagtc ctgtctg 



<210> 
<211> 
<212> 
<213> 


47 
6 

PRT 

Artificial Sequence 


<220> 
<223> 


Novel Sequence 


<400> 


47 


Thr Leu Glu Ser lie Met 
1 5 


<210> 
<211> 
<212> 
<213> 


48 
5 

PRT 

Artificial Sequence 


<220> 
<223> 


Novel Sequence 


<400> 


48 


Glu Tyr Asn Leu Val 
1 5 


<210> 
<211> 
<212> 
<213> 


49 
5 

PRT 

Artificial Sequence 


<220> 
<223> 


Novel Sequence 


<400> 


49 


Asp Cys Gly Leu Phe 
1 5 


<210> 
<211> 
<212> 
<213> 


50 
34 
DNA 

Artificial Sequence 


<220> 
<223> 


Novel Sequence 


<400> 


50 



gatcaagctt ccatggcgtg ctgcctgagc gagg 



<210> 51 

<211> 53 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 51 

gatcggatcc ttagaacagg ccgcagtcct tcaggttcag ctgcaggatg gtg 
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<210> 52 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 52 

gtcctcactg gtggccatgt actcc 25 

<210> 53 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 53 

ctgcgtccac cagagtcacg tctcc 25 

<210> 54 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 54 

cttggatgtt tgggctgccc ttctgc 26 

<210> 55 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 55 

gtttgtggct aacggcacaa aacacaattc c 31 

<210> 56 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 56 

ctgctcacgg ttgaccgcta cactgc 26 

<210> 57 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Novel Sequence 
<400> 57 

gtggccatga gccaccctga gctcc 



<210> 58 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 58 

cttcttctcc gacgtcaagg 



<210> 59 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 59 

cttcttctcc gacgtcaagg 



<210> 60 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 60 

cttcttctcc gacgtcaagg 



<210> 61 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 61 

caacggtctg acaacctcct 



<210> 62 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 62 

ttgctgtgat gtggcatttt g 



<210> 63 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 63 

caggaagccc ataaaggcat caa 23 

<210> 64 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 64 

acatcacctg cttcctgacc 20 

<210> 65 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 65 

ccagcatctt gatgcagtgt 20 

<210> 66 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 66 

ccatctccaa aatcctcagt c 21 

<210> 67 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Novel Sequence 

<400> 67 

gctgttaaga gcggacagga aa 22 
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